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RNA polymerase I transcription in vivo in transiently DNA-transfected cells has been used for exp7es5i<m of influenza vRNA molecules 
coding for chloramphenicol acetyltransfexase (CAT), in anti-srase orientation. Influenza virus superinfection served to provide viral RNA 
polymerase and otber proteins for transcriptional conversion of misuis-strand vRNA into plus-strand viral mRNA molecules expressing 
CAT activity. This system has been used for an analysis via nucleotide exchanges as well as deletions and insertions of both terminal 
segments of the vRNA sequence which cooperatively constitute the vRNA promoter structure. Several mutants with greatly enhanced 
expression rates over wild-type levels have been constructed, which also can be packaged and serially passaged into progeny virus. The 
di^ obtained for the mutations in various pnmioter elements support a model of consecutive, doublestiand vRNA promoter structures in 
bindif^ of viral polymerase and initiation of RNA synthesis. Pn^iarations of attenuated influenza virus for vaccination purposes include a 
single recombinant segment with promoter up mutation(s) for over-expicssion of an own or foreign getke product, which at the same time 
because of its over-replication serves to decrease the number of helper virus RNP segments. The same viruses further have been passaged 
through a step of ribozyme cleavage actmg at one of the helper viral segments* which will delete this vital function and structure with high 
rates (tarn the virus progeny. Hie resulting attenuated viruses will interact widi their target cells in only one round of abortive infection, 
and are unable to produce viral progeny. 



FOR THE PURPOSES OF INFORMATION ONLY 



Codes used to identify States party to iht PCT on the front pages of pamphlets publishing international 
applicatKMis under the PCT. 



AT 


AuMria 


GB 


Uidied Kingdom 


MR 


Mamitania 


KV 


AusbaBi 


GE 




MW 


Malawi 


BB 


Bwtedot 


GN 


Guinea 


NE 


Ntier 


BE 


BetfhiRi 


GS 


Greece 


NL 


Netheiiandi 


BF 


Divkina Fuo 


HU 


Honsaty 


NO 


Noiwiy 


BG 


Bt4g»ri« 


IE 


Irebnd 


NZ 


New Zealand 


Bl 


Benk 


IT 




PL 


Poimd 


BR 


BiKzil 


JP 


Japan 


PT 


Poitttgal 


BY 


Belimi 


KE 


Kenya 


RO 


Romvua 


CA 




KG 




RU 


RuMian I^Mletation 


CF 


Centnl Afrkvi RepobUc 


KP 


Dcmocnlic Peopfe'i Republic 


SD 


SudMk 


CG 






ofKoiea 


SE 


Sweden 


CH 


SwkwtaDd 


KR 


Republic of Koica 


SI 


Skweaia 


a 


Ceted'Ivoife 


KZ 


KaiddMan 


SK 


Slovdua 


CM 


CancnxNi 


U 




SH 


Sene§al 


CN 


anna 


LK 


SriLaka 


TD 


Ond 


CS 




LU 


LuDuabouit 


tg 


Togo 


cz 


Czech RrpuWkr 


LV 


Litvia 


TJ 




m 


GenDany 


MC 




TT 


ltinidadaiidTolN«o 


DK 


DtlHMffc 


MD 


Republic of Mokkmi 


UA 


incme 


KS 


S|Mia 
FinM 


mg 


Ma(Usatc«' 


US 


United SMei of America 


n 


ML 




UZ 


UzbekMian 




Fnscc 


MN 


Montolia 


VN 


VieiNan 


GA 


Gilxn 











wo 9^10641 



AN ATTENUATED VACCINATION AND GENE-TRANSFER VIRUS, A 
METHOD TO MAKE THE VIRUS AND A PHARMACEUTICAL COMPO- 
5 SmON COMPRISING THE VIRUS 

The object of the present inv^tion was to make a vaccination virus. This 
objective has been fulfilled with the segmented virus constructed as described 
herdn. 

The genome of influenza A viruses consists of 8 difTerent single-stranded viral 
10 RNA (vRNA) molecules of negative polarity, which have in common 5' and 3' 
terminal sequences largely complementary to each other. These conserved 
segments 13 and 12 nucleotides in length are known to form double-stranded RNA 
panhandle structures (Hsu et al., 1987; Fodor et al., 1993) which have been 
analysed in more detail recently in vitro using internally deleted model RNAs 
15 (Baudin et al., 1994; Tiley et al., 1994). In the virion the panhandle ends of all 
RNA segments are found in specific binding to viral RNA polymerase complexes, 
while the remaining internal segments stay single-stranded with viral nucleoprotein 
(NP) in cooperative binding (Compans et al, 1972; Honda et al., 1988; Martin et 
al, 1992). Upon infection these viral RNPs initially serve as templates for the 
20 synthesis of viral mRNAs by a spedfic cap-snatching mechanism (Plotch et al., 
1979; Braam et al, 1983), and later on will direct synthesis of fiilMength 
complementary RNAs (cRNAs), probably dependent on the absence or presence of 
newly synthe^zed NP protein (Shapiro and Krug, 1988). The plus-strand cRNAs 
are then used as templates for progeny vRNA synthesis. 

25 The viral RNA polymerase complex consisting of proteins FBI, PB2, and PA is 
involved in all three different modes of RNA synthesis during the viral replication 
cycle, following its specific binding to the terminal panhandle segments of both 
vRNAs and cRNAs. Sequence comparison reveals that the vRNA and cRNA 
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termini have similar, but not identical sequences. For that reason vRNA and 
cRNA recognition may be distinguished because of these structural alterations 
allowing for asymmetries in initiation of plus and minus strand RNA synthesis, 
and possibly in viral RNP packaging, which has also been suggested to be 
5 controlled by the panhandle RNA sequence (Hsu et al., 1987). 

Recently, we reported on an in vivo system for the introduction of specific 
mutations into the genome of influenza viruses: viral cDNA has been inserted in 
antisense orientation between mouse rDNA promoter and terminator sequences. 
This has been derived from in vitro transcription experiments based on nuclear 

10 extracts from Ehrlich ascites cells, which resulted in transcripts exactly resembling 
influenza vRNA. For a series of in vivo studies, the viral coding sequence was 
replaced by the coding sequence for chloramphenicol-acetyltransferase (CAT), 
however, with both influenza terminal non-coding sequences being retained 
exactly on the resulting vRNA transcripts. After transfection of this recombinant 

15 DNA template into mouse cells followed by influenza virus infection, CAT 
activity was detectable. Transfer of supematants to different cells demonstrated 
that CAT-vRNAs transcribed in vivo by cellular RNA polymerase I were not only 
transcribed by viral RNA polymerase into plus-strand mRNA and translated into 
CAT protein, but also were replicated and packaged into infectious progeny virus 

20 particles (Zobel et al.. 1993; Neumann et al., 1994). 

We have used this system for a stepwise introduction of single and multiple 
mutations into the conserved panhandle RNA sequence, thereby effectively 
converting the HA-vRNA promoter sequence into an HA-cRNA promoter 
sequence and vice versa. For these series of constructs CAT activities have been 
25 measured both in primarily transfected and infected B82 cells and, after passa^ng 
of B82 supematants, in secondarily infected MDCK cells. From the results 
obtained we propose a model for the terminal RNA sequence as being recognized 
RNA polymerase in consecutive steps of different structure when used as a 
template for initiation of viral mRNA synthesis. 

30 The present invention relates to a segmented. RNA virus which comprises one or 
more segments which have been genetically modified to show improved 
transcription, replication and/or expression rates. 
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The virus can be one where one or more modifications have been introduced in 
the noncoding region(s) and/or one or more modifications have been introduced in 
the coding region(s). A possibility is that at least one modified segment is derived 
from an ori^nal one by sequence variation(s). It is also possible that at least one 

S modified segment is an artificial addition to the set of original or modified original 
segments. The virus can be one wherein the modified segmmt comprises a 
nucleotide sequence which codes for a protein or peptide which is foreign to the 
original virus. Preferred is that the foreign protdn or peptide constitutes an antigen 
or antigen-like sequence, a T-cell epitope or related sequence. In such a case it is 

10 possible that the segment comprises repetitions of an antigen or epitope or other. 
Such an antigen or epitope can be for example derived from HIV, Herpes-Virus, 
Human Papilloma virus, Rhinovirus, CMV or Hog Cholera Virus (HCV). The 
virus of the present invention may be a single stranded negative-strand RNA virus 
as for example one of the Orthomyxoviridae family, the Bunyavtridae family or of 

15 the Arenaviridac family. The most preferred virus is an influenza virus. The virus 
of the present invention can also be a double-stranded RNA virus, as for example 
a reovirus, a rotavirus or an orbivirus. The viruses of the present invention can be 
used in gen-therapy. 

The present invention also relates to the vims and use of the virus for the 
20 preparation of pharmaceuticals. 

Mutational analysis of vRNA 3' terminal sequence positions. 
Influenza A viral RNA 5* and 3' ends have similar, but not identical sequences 
with nucleotide mismatches at positions 3, 5 and 8, and an additional unpmred 
nucleotide is located at position 10 in the 5* region. Nevertheless, both vRNA 

25 termini hybridize into a double stranded panhandle structure made up of twelve 
and thirteen nucleotides in common for all eight RNA segments, plus in average 
three additional basepairs specific for each of the vRNA molecules. Due to the 
deviations mentioned the cRNA or plus strand panhandle structures have to be 
different from the vRNA structures; however, both are recognized by viral RNA 

30 polymerase and are used for initiation of RNA synthesis, i.e. they are constituting 
a promoter structure. Even if in initial recognition and binding of RNA 
polymerase the double stranded RNA panhandle structure is known to be the 
substrate, and is also observed in virion RNPs (Hsu et al., 1987), for the initiation 
step of transcription at the 3* ultimate template position this terminal region has to 
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be separated into a partially single stranded* i.e. 'forked* structure (Fodor et al,, 
1994). RNA pdytnerase may be predicted to continue its binding interaction with 
both, the remaining double stranded segment: nucleotides 10 to IS versus II to 
16^, and to the single stranded 3' template segment: nucleotides I up to 9", as well 
5 as the 5' single stranded end (Tiley et al., 1994). Introduction of mutations at 
specific positions in either strand may hence alterate simultaneously both of these 
consecutive vRNA promote structures: panhandle and fork in different ways, and 
will in addition also result in corresponding variations of the cRNA promoter 
structure. 

10 To investigate the importance of the three mismatch positions, specific single, 
double or triple nucleotide exchanges were first introduced into the vRNA 3* end 
sequence at positions 3 , 5 and 8 , thereby approaching a fully double-stranded 
vRNA promoter structure, in a step-wise manner. At the same time the vRNA 3' 
end template sequence will become equivalent to the cRNA 3' end in these 

IS positions, but not in regard to the additional nucleotide at position 10. Single 
nucleotide exchanges according to this scheme (pHL1098, pHL1099, pHLIlOO) 
abolished the promoter activity, and no CAT activity was observed, as has been 
reported before with a different method (Luo et al., 1993). Two of the double 
mutation constructs (pHLl 101, pHLI 103) also gave negative results. 

20 In conurast, for pHLll02 (G3A, U8C)*, a significant CAT activity was detected, 
distinctiy higher than for the corresponding wild-type construct (pHL926;) which 
in tiie conditions applied (8 br after infection) resulted in rather low levels of CAT 
expression. This activity increase is fiirther enhanced for the final construct of this 
series carrying the triple exchange G3A, C5U and U8C (pHL1104), i. e. trans- 

25 fection of pHLl 104 DMA followed by influenza virus infection resulted in a very 
high level of CAT expression, also considerably above the pHLlI02 results. 

These results have been repeated using various conditions of transfection and 
infection as well as determining kinetic data during the course of infection. While 
the pHL1104 variant is always observed far superior over any wild-type construct 



30 ^Notations concerning nucleotides! to TS refer to positions in the vRNA 3' end, 
e.g. portion 2 designates the penultimate nucleotide; 5' end positions are given in 
ordinary numbers. The notation G3 A describes a mutational change of guanosine 
to adenosine at position 3. 
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that expression ratio may be variable and difftcult to quantitate (between around 
20 fold and nearly 100 fold). Rather short infective cycles of eight hours as used 
prevalently appear to put more slowly replicating, i.e. wild-type molecules at a 
disadvantage, in particular in passaging of packaged pseudo-vRNA molecules via 
5 virus progeny, both is found increased for wildtype and related constructs after 
DNA transfection plus twelve hours of infection (see Neumann et al., 1994). 
Remaining deviations in CAT expression ratios may be attributed to variations in 
growth conditions in indi\ddual experimrats. 

Mutational analysis of vRNA 5* terminal sequence positions 

10 We also addressed the question whether the unexpectedly high viral mRNA 
expression rate of pHL1104 is the consequence of a stabilized panhandle double- 
strand structure or may be directly attributed to the point mutations introduced into 
the vRNA 3' sequence, and active when being used as a single-stranded template 
segment, e.g. in the forked' structure. 

15 For this purpose we constructed pHL1124, three complementary point mutations 
introduced at the 5' end of the vRNA sequence again in positions 3, 5 and 8 
(U3C, G5A, A8G). Together with a sequence wild-type vRNA 3' end these 
variations again result in a panhandle structure free of mismatches and, therefore, 
pHL 1 124 is equivalent in this regard to pHLl 104, but different in the sequence of 

20 its template and non-template single strands. No significant CAT expression was 
detected for pHL1124. We conclude that the increased CAT activity of pHLl 104 
is not a consequence of the stabilized panhandle structure itself, but at least in part 
is a consequence of the individual nucleotide exchanges at positions T, 5 and i at 
the 3' end of the vRNA sequence, it is also more likely then to originate from 

25 other structural intermediates of initiation than a stabilized panhandle. 

Mutational analyses of concerted exchanges at both ends of the vRNA 
sequence 

In order to determine in detail the influrace of single, double and triple exchanges 
at the vRNA 5' end upon CAT expression rates we also used the improved vRNA 
30 3* end sequences of pHL1104 and pHLI102 as starting points rather than the 
corresponding wild-type sequence. From the series of experiments related to 
pHLl 104 and from the equivalent series related to pHL 1 102 it can be concluded 
that retaining a G residue in position 5 is the most important single feamre in 
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these 5' end variations. A single ^change into an A residue at po2»tion 5 as in 
pHLllSS will render the promoter entirely inactive, \^ile single exchanges in 
positions 3 or 8, as well as a 3 plus 8 double exchange will retain promoter 
activity even if reduced from the level observed for pHL1104, but still above 
5 wild-type expression rates. While the GSA nucleotide substitution opposite 
nucleotide CS in the 3* terminus results in losing one basepair (in the panhandle 
context) within (he pHLlKM series, a basepair is indeed gained by exactly the 
same GSA exchange within the pHL1102 series, i.e. opposite the US residue as 
present in the pHL1102 vRNA 3' end. Since again the GSA exchange results in 
10 loss of promote function inspite of gaining one basepair we conclude that the 
guanosine at position S may be important for RNA polymerase binding within the 
S* non-template single strand rather than being part of the panhandle double- 
stranded structure in this region. The importance of a G residue at this position 
has been shovm earlier in a single-step mutational analysis (Li and Palese, 1992), 
IS while non-template strand binding of RNA polymerase has been studied recently 
in Vitro (Tiley el al., 1994). Different from the deleterious effect of an exchange at 
position S ^changes at positions 3 and in particular 8 are of minor importance. 
The series of 5* nucleotide exchanges has also been repeated for the pHL1102 
version of the vRNA 3* end yielding exactly the same pattern of results, albeit at 
20 the somewhat reduced levels characteristic for pHL1102, The only result in both 
series not quite in agreement with a uniquely important role for a G-residue in 
portion 5 is the triple exchange of pHL1126 which retains low promoter activity 
inspite of an A residue in that position. Due to altogether six concerted exchanges 
in positions 3, S and 8 as well as T , S and 8 from the S* cmd 3* end of the 
25 vRNA sequence the pHLll26 vRNA panhandle structure is indeed nearly 
equivalent to a wild-type cRNA panhandle, with the exception of an unpaired 
adenosine being present in position 10 of pHL1126 while an unpaired uridine at 
position 10 is part of the vwld-type cRNA structure. This correlation may indicate 
a correct structure in pHL1126 for several other residues of (minor) importance 
30 which, therefore, apparentiy allows to compensate for the missing G residue in 
position 5, even if at a clearly reduced level of activity. - In the parallel pHL1102 
series the corresponding triple ^change clone pHLI12S does not show any 
promoter activity; however, because of its deviation at position S it does not 
completely resemble the cRNA panhandle structure. 
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Mutational analysis of the panhandle bulge structure around nucleotide 10 

An extra, unpaired residue in position 10 at the 5' end is a specific feature of the 
influenza viral RNA panhandle structure. It is causing or at least enforcing a m^or 
bulge of the structure, together with unp^red residues at position 9, and might be 

5 part of a specific recognition element of that structure by viral RNA polymerase. 
In ord^ to investigate the importance of that particular structural feature, a further 
series of plasmid constructs has been initiated, again based on pHLl 104 and its 3' 
terminal sequence as a reference. A perfectly matched KNA double-strand without 
any bulge has been achieved either by inserting an additional U residue in the 3* 

10 end sequence opposite AlO (pHL1140) or by deleting the A 10 residue from the 5* 
sequence (pHL1152). Finally, a bulge of opposite direction was created in the 
panhandle structure of pHL1164 with an extra U residue in position 10 of the 
3' end, and position 10 deleted from the 5' end sequence. While the latter two 
constructs proved inactive in the CAT assay, pHLlHO did show some promoter 

15 activity, albeit at a reduced level. We conclude from this result that a bulge in this 
region may not be recognized directly by viral RNA polymerase but may serve as 
a flexible joint between two more rigid structural elements that are involved in 
immediate contact with viral polymerase. The necessary RNA bending may also, 
but less efficiently be achieved in an A*U-basepalred structure like pHLlMO, 

20 while the other two structures would not permit such type of interaction with RNA 
polymerase. This interpretation has also been substantiated in a further series of 
variations in this region . 

Serial passaging of influenza vims carrying promoter mutants 

All previous experiments consisted of a first measurement of viral mRNA 
25 synthesis in DNA-transfected and infected B82 cells, followed by a second 
measuremrat of viral mRNA synthesis in infected MDCK cells, after passaging of 
progeny virus containing supematants. CAT expression in infected cells upon viral 
passing requires packaging of psaido-viral vRNAs, in addition to new rounds of 
viral mRNA synthesis in those cells leading to CAT expression again. All viral 
30 promoter mutants analysed and found active in transfected and helper-infected B82 
cells also resulted in CAT expression after transfer, and consistently in equivalent 
ratios of activity. Packaging, therefore, cannot be correlated with any specific 
element in the vRNA promoter structure so far, and does not appear to be a 
limiting factor in constructing influenza virus mutants in this system. While CAT 
35 expression after passaging in general appeared to be increased over the levels 
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before passs^ng this might have been ^mply the result of different cells being 
used for the first and second step of CAT analysis, with MDCK bring superior to 
B82 cells in influenza mRNA synthesis and also in progeny yields. Therefore, 
several experiments of serial passage have been performed using pHLll04 derived 

5 influenza supematants and others, in MDCK cells. In these serial passages, always 
done using aliquots of sup^natants harvested eight hours after infection for fiirther 
transfer, a stepwise increase of CAT expresi^on is observed (Fig. 2). Apparently 
the superior performance of viral RNA promoters carrying sequence deviations 
according to pHLl 104 is not only true for viral mRNA synthesis, but also for viral 

10 RNA replication. 

Therefore, mutant viral RNAs of this character become accumulated and 
effectively selected in further passaging, while packaging may be a neutral event 
in this regard, at least for the variants analysed here. 

Serial passaging extended 

15 During further passa^ng of supematants the CAT containing influenza segment 
carrying the mutationally altered viral promoter sequences became accumulated in 
a stepwise manner in the population of progeny viruses. In order to demonstrate 
this effect on the level of individual viruses being transferred we isolated in three 
indepentent experiments 50 to 85 plaques each after a third round of passage on 
20 MDCK cells. Each cell lysate obtained for the individual plaques was assayed for 
CAT activity according to the standard protocol. While in two of the experiments 
the fraction of CAT positive plaques was in the range of 4 to 8% (1 out of 50, 4 
out of 40 plaques) in one of these series this fraction amounted to 47% (19 of 40 
plaques). Both of these results demonstrate a substantial increase over the initial 
25 fraction of CAT-segment containing virus, which may be calculated to be in the 
range of 10"^ or at most 10"^, and slight variations in the conditions of growth 
during three steps of transfer may precipitate to result in the observed differences 
of CAT positive plaques. While every CAT positive plaque demonstrates the 
amplification of nine (not eight) viral RNA segments present in the initially 
30 infected cell, this may have resulted from a single virus carrying nine or more 
RNA segments or from coinfection by two defective viruses able to complement 
each other. 



WOM/10641 



PCT/EP95/03663 



Necessarily, accumulation of a pseudo-viral segment not contributiong to viral 
growth will, in further steps, become lethal to viral growth, even if a majority of 
virions may contain an average of eleven rather than eight RNA segments (Hsu et. 
al. 1987). Packaging of viral RNA-segments based on a general packaging signal 

5 id^itical for all dght segments and realized via a specific interaction chain: vRNA 
panhandle structure - viral RNA polym^^ise - viral NP protein - viral Ml protein 
will reflect the pools of the various vRNA segments in infected cells, and 
therefore may be biased towards an RNA segment superior in replication and 
overrq>resented in that pool. Biased replication and packaging will, however, lead 

10 to accumulation of lethal viral particles due to an imbalance between the dght (or 
nine) viral RNA segments. This prediction is borne out in continuing the viral 
passage of pHL1104 derived influenza supernatants beyond step three as 
exemplified in Fig. 2, While CAT expression based on transcription of the 
pHL1104 derived pswdoviral RNA segments is increased further up to the fifth 

15 passage the number of viable viruses reaches a maximum already after the second 
step of viral passaging, thus demonstrating the continuous accumulation of an 
over-replicated foreign segment, based on a superior panhandle sequence. 

At a stage representing the third or fourth passage as displayed in Fig. 2 a virus 
preparation obtained in this way can be regarded as the equivalent of an attenuated 
20 viral strain. While the concentration of attenuated virus particles that can be 
achieved in this way may appear to be limited a stage equivalent to passage 4 in 
Fig. 2 may be delayed upon coinfection with wildtype helper virus during first or 
second steps of transfer, and con^derably increased concentrations of attenuated 
virus preparations might be achievable in this way. 

25 pHLl 104*mediated high-rate expression of foreign proteins can also be used (after 
two or more steps of amplification via serial passaging on MDCK cells) for high 
rate synthesis of foreign proteins in embryonated chicken eggs, following a 
general method of preparation of viral stocks as used fcM^ influenza and other 
viruses, i. e. injection of virus suspensions into the yolk sac. Protein preparations 

30 isolated from those embryonated and infected cells will be glykosylated and 
modified in other ways according to their origination from eukaryotic cells. 

A second method of influenza virus attenuation has been achieved via cleavage of 
either one of the influenza viral RNAs, preferrably the M or NP gene (segments 7 
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or 5), via ribozyme hydrolysis in a specialised mode of action. The ribozyme 
RNAs which may be covalcntly inserted into the pSV2neo early mRNA, located 
between the neomycin resistance gene and the small t intron sequence ori^nating 
from SV40 viral DNA, or expressed from similar expression cassettes, are directed 
5 against the 5* end sequence of segment 7 (or another of the influenza vRNA 
segments). 

During initiation of mRNA synthesis the 5' terminal sequence which is involved in 
formation of the panhandle structure is at first covered by viral RNA polymerase 
in association with that double stranded promoter region. It will, however, become 
10 single-stranded and free of protein, ance the polymerase molecule will start 
transcription at the 3* end and move along the 3* template sequence while 
synthezising a perfectly hybridizing S' RNA daughter strand, superior in that 
regard to the parental panhandle 5* segment. Ribozyme RNAs which may be 
inhibited in their activity either by RNA substrates involved in double strand 
15 formation or if RNA substrates are covered by protein, have been directed with a 
3' complementary sequence towards that protein-free 5' sequence of the substrate 
vRNA molecule for initiation of hybridization, which then will be extended across 
the entire complementary region of approximately 100 nucleotides i. e. well into 
the vRNA sequence initially covered by NP protdn. 

20 A second feature of the ribozyme RNAs as applied for inactivation of influenza 
vRNA molecules is their double-unit hammerhead character, directed against not 
one, but two close GUY cleavage sites, e. g. GUUi^ and GUU36 in segment 7 or 
GUC30 and GUC4g in segment 5, which are also known to be invariable in 
sequence comparisons of influenza isolates. 

25 Both features of anti-influenza ribozymes as pointed out contribute to a reduction 
of typically two logs (up to three logs) in production of viral progeny in template 
ribozyme DNA transfected cdls as compared to infection of mock transfected 
cells, both at moi 1 and 20 h after Lipofectamin-DNA treatment. Ribozyme 
treatment can be applied after two or three rounds in MDCK cells of pHL1104- 

30 promoted amplification of a pseudo-viral RNA segment originating from RNA- 
Polymerase I transcription, in the presence of helper virus as used in initial 
superinfection. 
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In a simple version ribozyme treatment as described above is employed as a 
selection technique. Here, its application is apptoptiate if the pseudoviral RNA is 
indeed a (foreign) influenza segment carrying particular mutations but capable in 
principle to act as a functional substitute for the helper viral segment destroyed by 
5 ribozyme cleavage. For that purpose the substitute viral segment to be selected in 
that procedure has to be mutagraized in advance at the two cleavage sites 
indicated above in ord^- to become resistant against ribozyme interference. In 
anoth^ application ribozyme cleavage of helpervirus vRNA can be used for 
attenuation of recombinant influenza virus preparations. Here, the pseudo-viral 

10 RNA segment may be deigned in a way which renders it incapable to substitute 
for a helpervirus gene. Therefore, viral passaging into ribozyme template DNA 
transfected cells would lead to an abortive infection only, because of ribozyme 
mediated destruction of an important viral gene, if its gene product would not be 
added for complementation via expression from a cDNA construct which is also 

15 DNA-transfected into the cell together with ribozyme-expressing DNA 20 h before 
viral infection. In this way viral progeny is obtained that is attenuated because of 
ribozyme cleavage of one of tiie vRNA segments, and effectively that segment is 
missing in the virions because it can no longer be packaged. Viral preparations 
obtained in this way are capable of only one round of infection because of their 

20 inherent Ml + M2 protein complementation, and therefore are suited for 
vaccination purposes. Animal infection with progeny virus as isolated after the 
ribozyme attenuation step results in. abortive infection, but viral proteins 
syntiiesized in infected cells are able to induce B-cell and T-cell responses in such 
animals. 

25 In influenza viral RNA synthesis parental negative-strand vRNA is copied into 
plus-strand cRNA, which again is copied into progeny vRNA, from the first to the 
last nucleotide. This amplification of viral RNAs, however, proceeds in an 
inherentiy asymmetric way, since vRNA molecules are synthesized in excess over 
cRNA molecules. This result is consistent with the idea that cRNA carries a 

30 promoter structure more active in binding viral RNA polymmse and in initiation 
of RNA synthesis, i.e. 'stronger* than does vRNA. While at first simply the two 
3' ends of single-stranded vRNA and cRNA templates have been implicated as 
promoter sequences, the detection of double-stranded panhandle structures 
involving both ends of the vRNA sequence in virions (Hsu et aL, 1987) suggested 

35 more complicated substrates for RNA polymerase binding and initation of 
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daughter-Strand synthesis. A slightly different panhandle structure has also been 
observed wth model vRNA molecules in the absence of viral proteins in vitro 
(Baudin et al., 1994), possibly calling for a structural change upon viral RNA 
polymerase binding, i.e. a bulge may be shifted from position 4 to position 10 in 

5 that reaction (see Fig. 1). While originally several of the RNA polymerase / vRNA 
binding expaim«its in vitro appeared to show recognition only of 3' end 
oligonucleotides, this has since been shown to be an artifact after pure, 
recombinant viral polymerase free of residual RNA became available, instead of 
enzyme preparations from virions. Under these conditions RNA polymerase 

10 binding to viral RNA as well as endonucleolytic cleavage of cellular mRNAs by 
subunit PB2 was observed to deprad on vRNA 5' plus 3' terminal sequence 
binding, with even higher affinity for the 5' non-template segment (Hagen et al., 
1994; Tiley etal., 1994). 

Different from the employment of both vRNA and cRNA promoter structures in 
15 replication physiologically only vRNA promoters will also serve in initiation of 
viral mRNA synthesis according to the cap-snatching mechanism (Plotch et al., 
1979; Braam et al., 1983). While it has been claimed that cRNA promoters would 
not have the capacity to act according to this scheme (Tiley et al., 1994), the 
failure to observe viral antisense mRNA molecules may simply reflect the 
20 inavailability of cRNA molecules early in infection, i.e. in the absence of surplus 
viral NP protein, and small amounts of such molecules might even have gone 
undetected. In this invention we describe a mutagenizational analysis of the vRNA 
promoter structure in vivo which in approaching the structure of the cRNA 
promoter via three nucleotide exchanges shows considerably improved activity in 

25 viral mRNA synthesis over vRNA promoter wild-type levels. Continuing increase 
of viral CAT mRNA expression during consecutive steps of viral passaging 
suggests that the same vRNA promoter mutants also show increased activity in 
cRNA synthesis, both in accordance with the idea that the cRNA promoter 
structure might be "stronger* than the vRNA promoter, also in initiation of viral 

30 mRNA synthesis. 

Additional variations of the 5* terminal sequence cleariy indicate the major 
importance of a G residue in position 5, irrespective of complemwitarity or not to 
position 5 at the 3' end. The unique role of this G residue has been observed 
before in a serial mutagenizational analysis (Li and Palese, 1992). According to 
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botti data guanosine residue 5 may be involved in single-strand binding of RNA 
polymerase as has indeed been observed for the non-template strand terminal 
segment (Tiley et al., 1994). While panhandle double-strand structures are likely to 
constitute the initial RNA polymerase binding substrate a partial separation of 
S template and non-template strands is expected to take place consecutively resulting 
in a 'forked structure* such as proposed by Fodor et al. (1994). Specific and tight 
binding of RNA polymerase in this structure may predominantly be oriented 
towards sequfflce elements in the non-template strand, since the growing point of 
RNA synthesis will have to move along the entire template strand following its 
10 initiation. It is, therefore, possible that such a binding interaction survives most or 
all of an individu^ round of mRNA synthesis as has been proposed (Tiley et al., 
1994). 

The triple nucleotide exchanges as introduced in vRNA molecules derived from 
pHL1104 templates will create three additional basepairs able to stabilize the 

IS panhandle structure in general, but more specifically they will favor a bulged 
adenosine 10 over the bulged adenosine 4 conformation as observed for the wild- 
type sequence //? vitro (Baudin et al., 1994), Since the changes introduced here 
lead to a considerable enhancemmt of promoter activity we propose that a bulged 
10 conformation may be the structure underlying the vRNA / polymerase binding 

20 reaction, which otherwise would have to be achieved only as a result of that 
interaction. A bulged 10 adenosine residue may constitute a kind of flexible joint 
or angular kink which in turn suggests two major, structurally stable binding sites 
to the left and right of this element. One of these sites has to be the double- 
stranded sequence element of (in average) six basepsurs extending from positions 

25 11 to 16 and 10 to B , respectively. While the distal three basepairs are known 
to be variable for the various RNA segments, basepair 13/12 has been shovm to be 
exchangeable experimentally, and also the number of basepairs has been reduced 
to four without complete loss of function (Luo et al., 1991). With all of these data 
it seems clear that the main recognition element in this region is an RNA double- 

30 strand of cmain stability, while it remains possible that residue 12 guanosine and 
potentially others are also recognized individually within that structure. A major 
second binding element for RNA polymerase on the other side relative to position 
10 is less evident, but may be located in a distance of nearly one helical turn in 
the de-bulged region around position 4, since direct contacts are suggested by that 

35 initial conformational interaction, and also by the specific requirement of a 
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guanosine residue in position which is likely to interact not only during^ but 
also before partial strand s^aration in fliat region^ i.e. in the panhandle as wdl as 
the forked structure. While an extra adenosine residue in position 10 may be 
optimal for creating a correctly shaped bulge in this region of RNA, structural 
5 variants are possible in this regard (see pHL 1140) which excludes direct 
interactions between RNA polym^^ and residues constituting that bulge. 

In summary we are proposing a model (see Fig. 1) of consecutive steps of 
interaction between a vRNA or cRNA promoter structure and viral RNA 
polymerase: 

10 bulged 4 panhandle bulged 10 panhandle / polymerase -> forked RNA / 
polymerase (bound to 5G and ds element 11-16) — > initiation of RNA synthesis 
(recognition of 3' end of template). 

Attenuation of influenza viruses for preparation of a live nasal vaccine relies on 
two mechanisms: 1) preferential amplification of a recombinant viral segment 

15 carrying the pHL1104 promoter mutation, which will increase its rate in packaged 
viral RNP particles and indirectiy decrease that of the eight helper virus RNP 
particles. This competition results in an increase of defective viruses from which 
one or more of the regular viral gene segments are missing. 2) Sequence specific 
ribozyme cleavage of one or more helper virus RNA segments, if compensated 

20 through gene product expression for functional complementation. This dual 
interaction will result in virus progeny, which is capable of only one round of 
infection, abortive because of the missing viral protein(s) that are required for their 
propagation. - Ribozyme cleavage of one out of two sister vir^ gene segments, 
sensitive and (artificially) insensitive for its hydrolysis may also be used 

25 (repeatedly) for selection purposes, including selection for viral gene constructs 
expressed via RNA polymerase I transcription. 

MATERIALS AND METHODS 

Plasmid constructions 

Plasmids with mutated vRNA and/or mutated cRNA promoter sequences are 
30 derivatives of pHL926 (Zobel et al., 1993; Neumann et al., 1994). In pHL926 a 
hybrid CAT cDNA with flanking non^coding sequences derived from influenza 
vRNA segments has been precisely inserted in antisense orientation between 
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mouse rDNA promoter and terminator sequences. The CAT reporter gene in this 
way has been introduced by exactly replacing the coding sequence for 
hemagglutinin, retaining the untranslated viral 5' and 3* sequences of segment 4. 

vRNA 5' end mutations were created by PCR, using a general primer hybridizing 
5 to a position in the flanking rDNA promoter sequence, and a specific primer 
carrying the desired nucleotide substitution to be introduced in the viral terminal 
sequence. The polymerase chain reaction products were first digested by the 
restriction enzymes BgHl and Spel, inserted into the left boundary position by 
exchanging the segment between these appropriate restricton sites in pHL926, and 
10 finally confirmed in their constitution by DNA Sanger sequencing. 

Generation of vRNA 3' end mutations followed the same general scheme at the 
right boundary. PGR products were obtained by using a general primer 
complementary to a CAT gene internal sequence position, and a specific primer 
with appropriate nucleotide exchanges inserted into its sequence. Following 
15 digestion with restriction enzymes Ncol and Seal, the PGR products were cloned 
into Ncol- and iScaI(partially)-digested plasmid pHL926. Any PCR derived 
sequences were investigated by DNA sequencing. 

For constructs with both 5* end and 3* end mutations in combination, 5* variation 
containing fragments were obtained by Bglll and Spel restriction and inserted into 
20 the appropriate 3* terminal variation plasmids. 

Cells and viruses 

Influenza A/FPV/Bratislava viruses were grown in NIH3T3 cells. For transfection 
and passaging experiments B82 cells (a mouse L cell line) and MDCK cells were 
used. 

25 Lipofectamin DNA transfection and influenza virus helper infection 

For DNA transfection 10^ B82 cells were used. 5 ^ig of plasmid DNA were mixed 
with 60 pg of Lipofectamin (Lipofectamin™, GBBCO/BRL) in serum-free medium 
and incubated at room temperature for 10-15 min. This mixture was added to the 
cells washed twice with serum-free medium, and the incubation with 
30 Lipofectamin/DNA was continued for 1 hr. After further incubation with DMEM 
medium for I hr the transfected B82 cells were infected with influenza 
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A/FPV/Bratislava at a multiplicity of infection of 0.01 to 1 for another 30-60 min. 
Further incubation was performed with DMEM medium. 

Passaging of virus containing supernatants 

Under standard conditions 8 hr after influenza infection (at moi 0.1 to 1) cells 
5 were harvested for CAT assays, and supernatants were collected and spun down at 
1200 rpm for 5 min for removal of cell debris. 

Aliquots of virus containing cleared supernatants were used for plaque tests, and 
another aliquot was adsorbed to 10^ MDCK cells for 30-60 min for further 
passa^ng. Again 8 hr after infection the CPE was verified, and cells and 
10 supernatants were collected and treated as before. 

CAT assay 

Cell extracts were prepared as desribed by Gorman et al. (1982). CAT assays were 
done with [^"^Clchloramphcnicol or fluorescent-labeled chloramphenicol 
(borondipyrrometiiane difluoride fluorophore; FLASH CAT Kit, Stratagene) as 
15 substrates. 

For [^''Clchloramphenicol the assay mixture contained: 0.1 jiCi 
['^C]chloramphenicol, 20 jil 4 mM Acetyl-CoA, 25 jil 1 M Tris-HCl (pH 7 5) and 
50 \il of cell lysate in a total volume of 150 The assay mixture for the 
fluorescent-labeled substrate contained (in a final volume of 80 10 jil 0.25 M 
20 Tris-HCl (pH 7.5), 10 pi 4 M Acetyl-CoA, 10 jil fluorescent-labeled 
chloramphenicol, and 50 pi of cell lysate. After an incubation time of 16 hr the 
reaction products were separated by chromatography and either autoradiographed 
or visualized by UV illumination and photography. 
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LEGENDS TO FIGURES 

Fig. 1. Proposed scheme of consecutive confonnational steps occurring prior to 
initiation of viral mRNA synthesis in influenza vRNA, in wildtype and pHLl 104 
derived mutant sequences. Positions of triple mutation in pHL1104 vRNA are 
25 indicated in bold and larger size letters. 

(A) Free RNA panhandle structure, bulged at position 4 (wildtype vRNA; Baudin 
^ al, 1994) or at position 10 (mutant vRNA). (B) Bulged 10 panhandle structures 
after binding of viral RNA polymerase; proposed protein binding positions marked 
by underlignments. (Q Forked structures of partial strand separation. (D) Initiation 
30 of viral mRNA synthesis via hybridization of capped primer oligonucleotide. 

Fig- 2. Serial passaging of pHL 1104-derived progeny viruses. 
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10^ B82 cells were transfected with 5|i of pHL 1104 DNA (in 60 ng 
Lipofectamin) and infected two hours later mth influenza A/FPV/Bratisiava 
(m.o.i:!). 8 hr post-infection the cells were assayed for plaque forming units (dark 
column) and for CAT activity (hatched column) After sedimentation an other 

5 aliquot of the supmiatant was adsorbed to 10^ MDCK cells. Further rounds of 
passaging were done equivalently by harvesting the cells 8 hr after infection for 
assaying CAT activities, whereas an aliquot of the supernatant was always 
adsorbed to fresh MDCK cells. Numbers of serial passages are indicated at the 
bottom. Plaque forming units per ml refer to the left ordinate, CAT expression 

10 rates (relative to primary imfection levels) to the ordinate on the right 
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SEQUENCE LISTING 

(X) GENERAL INFORMATION: 

(i) APPLICANT: 

(A) NAME: Bayer AG 

5 (B) STREET: Bayerwerk 

(C) CITY: Leverkusen 

(E) COUNTRY: Deutschland 

(F) POSTAL CODE (ZIP); 51368 
(6) TELEPHONE: (0)214-3 061455 

10 (H) TELEFAX: (0)214-303482 

(ii) TITLE OF INVENTION: Vaccination virus, method of making it and 
pharmaceutical composition comprising that virus 

(iii) NUMBER OF SEQUENCES: 8 

15 (iv) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Floppy disk 

(B) COMPUTER: IBM PC compatible 

(C) OPERATING SYSTEM: PC-DOS /MS-DOS 

(D) SOFTWARE: Patentin Release #1.0, Version #1.30B (EPO) 

20 (2) INFORMATION FOR SEQ ID NO: 1: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 5241 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 
25 (D) TOPOLOGY: circular 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 

(iv) ANTI- SENSE: NO 

(vi) ORIGINAL SOURCE: 
30 (A) ORGANISM: Influenza virus, RNA sequence 

(C) INDIVIDUAL ISOLATE: pHL926 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1: 

CTGGCGGCAG TAGCGCGGTG GTCCCACCTG ACCCCATGCC GAACTCAGAA GTGAAACGCC 6 0 

GTAGCGCCGA TGGTAGTGTG GGGTCTCCCC ATGCGAGAGT AGGGAACTGC CAGGCATCAA 12 0 

35 ATAAAACGAA AGGCTCAGTC GAAAGACTGG GCCTTTCGTT TTATCTGTTG TTTGTCGGTG ISO 

AACGCTCTCC TGAGTAGGAC AAATCCGCCG GGAGCGGATT TGAACGTTGC GAAGCAACGG 24 0 
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CCCGGAGGGT GGCGGGCAGG ACGCCCGCCA 
GCCATCCTGA CGGATGGCCT TTTTGCGTTT 
ACATTCAAAT ATGTATCCGC TCATGAGACA 
AAAAAGGAA6 AGTATGAGTA TTCAACATTT 
5 ATTTTGCCTT CCTGTTTTTG CTCACCCAGA 

TCAGTTGGGT GCACGAGTGG GTTACATCGA 
GAGTTTTCGC CCCGAAGAAC GTTTTCCAAT 
CGCGGTATTA TCCCGTGTTG ACGCCGGGCA 
TCAGAATGAC TTGGTTGAGT ACTCACCAGT 
10 AGTAAGAGAA TTATGCAGTG CTGCCATAAC 

TCTGACAACG ATCGGAGGAC CGAAGGAGCT 
TGTAACTCGC CTTOATCGTT GGGAACCGGA 
TGACACCACG ATGCCTGCAG CAATGGCAAC 
ACTTACTCTA GCTTCCC6GC AACAATTAAT 
15 ACCACTTCTG CGCTCGGCCC TTCCGGCTGG 

TGAGCGTGGG TCTCGCGGTA TCATTGCAGC 
CGTAGTTATC TACACGACGG GGAGTCAGGC 
TGAGATAGGT GCCTCACTGA TTAAGCATTG 
ACTTTAGATT GATTTAAAAC TTCATTTTTA 
20 TGATAATCTC ATGACCAAAA TCCCTTAACG 

CGTAGAAAAG ATCAAAGGAT CTTCTTGAGA 
GCAAACAAAA AAACCACCGC TACCAGCGGT 
TCTTTTTCCG AAGGTAACTG GCTTCAGCAG 
GTAGCCGTAG TTAG6CCACC ACTTCAAGAA 
25 GCTAATCCTG TTACCAGTGG CTGCTGCCAG 

CTCAAGACGA TAGTTACCGG ATAAGGCGCA 
ACAGCCCAGC TTGGAGCGAA CGACCTACAC 



TAAACTGCCA GGCATCAAAT TAAGCAGAAG 300 

CTACAAACTC TTTTGTTTAT TTTTCTAAAT 36 0 

ATAACCCTGA TAAATGCTTC AATAATATTG 420 

CCGTGTCGCC CTTATTCCCT TTTTTGCGGC 480 

AACGCTGGTG AAAGTAAAAG ATGCTGAAGA 54 0 

ACTGGATCTC AACAGCGGTA AGATCCTTGA 600 

GATGAGCACT TTTAAAGTTC TGCTATGTG6 66 0 

AGAGCAACTC GGTCGCCGCA TACACTATTC 720 

CACAGAAAAG CATCTTACGG ATGGCATGAC 78 0 

CATGAGTGAT AACACTGCGG CCAACTTACT 84 0 

AACCGCTTTT TT6CACAACA TGGGGGATCA 90 0 

GCTGAATGAA GCCATACCAA ACGACGAGCG 960 

AACGTTGCGC AAACTATTAA CTGGCGAACT 102 0 

AGACTGGATG GAGGCGGATA AAGTTGCAGG 108 0 

CTGGTTTATT GCTGATAAAT CTGGAGCCGG 1X4 0 

ACTGGGGCCA GATGGTAAGC CCTCCCGTAT 1200 

AACTATGGAT GAACGAAATA GACAGATCGC 126 0 

GTAACTGTCA GACCAAGTTT ACTCATATAT 1320 

ATTTAAAAGG ATCTAGGTGA AGATCCTTTT 13 8 0 

TGAGTTTTCG TTCCACTGAG CGTCAGACCC 144 0 

TCCTTTTTTT CTGCGCGTAA TCTGCTGCTT 1500 

GGTTTGTTTG CCGGATCAAG AGCTACCAAC 1B60 

AGCGCAGATA CCAAATACTG TCCTTCTAGT 1620 

CTCTGTAGCA CCGCCTACAT ACCTCGCTCT 1680 

TGGCGATAAG TCGTGTCTTA CCGGGTTGGA 174 0 

GCGGTCGGGC TGAACGGGGG GTTCGTGCAC 1800 

CGAACT6AGA TACCTACAGC GTGAGCATTG 186 0 
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A6AAA6C6CC AC6CTTCCCG AAGGGA6AAA 
CGGAACAGGA GA6CGCACGA GGGA6CTTCC 
TGTCPGGTTT CGCCACCTCT GACTTGAGCG 
GAGCCTATGG AAAAAC6CCA GCAACGCGGC 
5 TGGCG6ACGC GATGGATATG TTCTGCCAAG 

AGAATTGATT GGCTCCAATT CTTGGAGTGG 
CATTCAG6TC GAGGTGGCCC GGCTCCATGC 
GTATAGGGCG GCGCCTACAA TCCATGCCAA 
AATCGCCGTG ACGATCAGCG GTCCAGTGAT 
10 TCCTTGAAGC TGTCCCTGAT GGTCGTCATC 

GGGCATCCCG ATGCCGCCGG AAGCGAGAAG 
CGTCGCGAAC GCCAGCAAGA CGTAGCCCAG 
TTACATTAAT TGCGTTGCGC TCACTGCCCG 
TGCATTAATG AATCGGCCAA CGCGCGGGGA 
15 GTTTTTCTTT TCACCAGTGA GACGGGCAAC 

GAGAGTTGCA GCAAGCGGTC CACGCTGGTT 
GTGGTTGACG GCGGGATATA ACATGA6CTG 
ATATCCGCAC CAACGCGCAG CCCGGACTCG 
TGATCGTTGG CAACCAGCAT CGCAGTGGGA 
20 TGTTGAAAAC CGGACATGGC ACTCCAGTCG 

TTGCGAGTGA GATATTTATG CCAGCCAGCC 
GGGCCCCCGG TGACAGGGAC AGAGAGGGCT 
GATCCAAAGC TCCAGGGCGA GCTCGAATTC 
CTTACACATC CCAGCCCTGA AAAAGGGCAT 
25 TTACGTTGAC ACCATCGAAT GGTGCAAAAC 

AGAGAGTCAA TTCAGGGTGG TGAATGTGAA 
TGCCGGTGTC TCTTATCAGA CCGTTTCCCG 



GGC6GACAG6 TATCCGGTAA GCGGCAGGGT 192 0 

A6GGGGAAAC GCCTGGTATC TTTATAGTCC 1980 

TCGATTTTTG TGATGCTCGT CAGGGGGGCG 204 0 

CCGAGATGCG CCGCGTGCGG CTGCTGGAGA 2100 

GGTTGGTTTG CGCATTCACA GTTCTCCGCA 2160 

TGAATCCGTT AGCGAGGTGC CGCC6GCTTC 222 0 

ACCGCGACGC AACGCGGGGA GGCAGACAAG 2280 

CCCGTTCCAT GTGCTCGCCG AGGCGGCATA 234 0 

CGAAGTTAGG CTGGTAAGAG CCGCGAGCGA 24 00 

TACCTGCCTG GACAGCATGG CCTGCAACGC 2460 

AATCATAATG GGGAAGGCCA TCCAGCCTCG 2520 

CGCGTCGGCC AGCTTGCAAT TCGCGCTAAC 258 0 

CTTTCCAGTC GGGAAACCTG TCGTGCCAGC 264 0 

GAGGCGGTTT GCGTATT6GG CGCCAGGGTG 2700 

AGCTGATTGC CCTTCACCGC CTGGCCCTGA 2760 

TGCCCCAGCA GGCGAAAATC CTGTTTGATG 2820 

TCTTCGGTAT CGTCGTATCC CACTACCGAG 288 0 

GTAATGGCGC GCATTGCGCC CAGCGCCATC 294 0 

ACGATGCCCT CATTCAGCAT TTGCATGGTT 3 000 

CCTTCCCGTT CC6CTATCGG CTGAATTTGA 3060 

AGACGCAGAC GCGCCGAGAC AGAACTTAAT 3120 

TCTGGAGGAA AAAAGAAAAA AAAAAAAAAA 3180 

CCCGGTAAAG CCGCTTAAGA CATTCCCGCT 324 0 

CAAAATAAAC CACACCTATG GTGTATGCAT 3300 
CTTTCGCGGT ATGGCATGAT AGCGCCCGGA . 3360 
ACCAGTAACG TTATACGATG TCGCAGAGTA 342 0 
CGTGGTGAAC CAGGCCAGCC ACGTTTCTGC 348 0 
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GAAAACGCGG GAAAAAGTGG AAGC6GCGAT 
GGCACAACAA CTGGCGGGCA AACAGTCGTT 
CCTGCACGCG CC6TCGCAAA TTGTCGCGGC 
CAGCGTGGTG GTGTCGATGG TAGAACGAAG 
5 CAATCTTCTC GCGCAACGCG TTCAGTGGGC 

C6CTGGATGA CCAGGATGCC ATTGCTGTGG 
TTCTTGATGT CTCTGACCAG ACACCCATCA 
CGCGACTGGG CGTGGAGCAT CTGGTCGCAT 
GCCCTGTACG TCTGAGGCCG AGGGAAAGCT 
10 GGTCTTATCA GTTCTCCGGG TTGTCAGGTC 

TTCGTTATGG GGTCATTTTT GGGCCACCTC 
GCCTGTCACT TTCCTCCCTG TCTCTTTTAT 
GACCTGGAGA TAGGTAGTAG AAACAAGGGT 
CCTGCCACTC ATCGCAGTAC TGTT6TAATT 
15 CACAGACGGC ATGATGAACC TGAATCGCCA 

AATATTTGCC CATGGTGAAA ACGGGGGCGA 
CAAAACTGGT GAAACTCACC CAGGGATTGG 
CTTTAGGGAA ATAGGCCAGG TTTTCACCGT 
GAAACTGCCG GAAATCGTCG TGGTATTCAC 
20 CAT6GAAAAC GGTGTAACAA GGGTGAACAC 

TTGCCATACG G7VATTCCGGA TGAGCATTCA 
GATAAAACTT GTGCTTATTT TTCTTTACGG 
CGGTCTGGTT ATAGGTACAT TGAGCAACTG 
GCCATTGGGA TATATCAACG GTGGTATATC 
25 TTATCCCCTG TTTCTACTCC CCCCCAACTT 

TTGTTTTTTT TTTTTCCCCC GATGCTGGAG 
CCCCCCCCCC CCCGGCGCGG AACGGCGGGG 



GGCGGAGCTG AATTACATTC CCAACCGCGT 354 0 

GCTGATTGGC GTTGCCACCT CCAGTCTGGC 3600 

GATTAAATCT CGCGCCGATC AACTG6GTGC 3660 

CGGCGTCGAA GCCTGTAAAG CGGCGGTGCA 3 720 

TAGATCTACC GCGGTAGATC ATTAACTATC 3700 

AAGCTGCCTG CACTAATGTT CCGGCGTTAT 3 84 0 

ACAGTATTAT TTTCTCCCAT GAAGACGGTA 3900 

TGGGTCACCA GCAAATCGCG CTGTTAGCGG 3960 

ATGGGCGCGG TTTTCTTTCA TTGACCTGTC 4020 

GACCAGTTGT TCCTTTGAGG TCCGGTTCTT 40B0 

CCCAGGTATG ACTTCCAGGT ATTCTCTGTG 414 0 

GCTTGTGATC TTTTAGATCT GGTCCTATT6 42 00 

GTTTTTAAAT ACTAGTACAT TACGCCCCGC 42 60 

CATTAAGCAT TCTGCCGACA TGGAAGCCAT 4 32 0 

GCGGCATCAG CACCTTGTCG CCTTGCGTAT 43 8 0 

AGAAGTTGTC CATATTGGCC ACGTTTAAAT 444 0 

CTGAGACGAA AAACATATTC TCAATAAACC 4 500 

AACACGCCAC ATCTTGCGAA TATATGTGTA 456 0 

TCCAGAGCGA TGAAAACGTT TCAGTTTGCT 462 0 

TATCCCATAT CACCAGCTCA CCGTCTTTCA 468 0 

TCAGGCGGGC AAGAATGTGA ATAAAGGCCG 4 74 0 

TCTTTAAAAA GGCCGTAATA TCCAGCTGAA 4 800 

ACTGAAATGC CTCAAAATGT TCTTTACGAT 4 86 0 

CAGTGATTTT TTTCTCCATG ATTAATAGAA 4 920 

CGGAGGTCGA CCAGTACTCC GGGCGAAACT 4 980 

GTCGACCAGA TGTCCGAAAG TGTCCCCCCC 504 0 

CCACTCTGGA CTCTTTTTTT XTTTTTTTTT 5100 
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TTTTTTTTTG GGGATCCTCT AGAGTCGACC TGCAGCCCAA GCTAGCGGCC GCTAGCTTCT 5160 

GTTTTGGCGG ATGAGAGAAG ATTTTCAGCC TGATACAGAT TAAATCAGAA CGCAGAAGCG 5220 

GTCTGATAAA ACAGAATTTG C 5241 

(2) INFORMATION FOR SEQ ID NO: 2: 

5 (i) SEQUENCE CHARACTERISTICS: 

<A) LENGTH: 5241 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: circular 

10 (ii) MOLECULE TYPE: DNA (genomic) 

Uii) HYPOTHETICAL; NO 

(iv) ANTI -SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Influenza virus, RNA sequence 
15 (C) INDIVIDUTO. ISOLATE: pHL1104 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2: 

CTGGCGGCAG TAGCGCGGTG GTCCCACCTG ACCCCATGCC GAACTCAGAA GTGAAACGCC 60 

GTAGCGCCGA TGGTAGTGTG GGGTCTCCCC ATGCGAGAGT AGGGAACTGC CAGGCATCAA 120 

ATAAAACGAA AGGCTCAGTC GAAAGACTGG GCCTTTCGTT TTATCTGTTG TTTGTCGGTG 18 0 

20 AACGCTCTCC TGAGTAGGAC AAATCCGCCG GGAGCGGATT TGAACGTTGC GAAGCAACGG 24 0 

CCCGGAGGGT GGCGGGCAGG ACGCCCGCCA TAAACT6CCA GGCATCAAAT TAAGCAGAAG 3 00 

GCCATCCTGA CGGATGGCCT TTTTGCGTTT CTACAAACTC TTTTGTTTAT TTTTCTAAAT 36 0 

ACATTCAAAT ATGTATCCGC TCATGAGACA ATAACCCTGA TAAATGCTTC AATAATATTG 420 

AAAAAGGAAG AGTATGAGTA TTCAACATTT CCGTGTCGCC CTTATTCCCT TTTTTGCGGC 48 0 

25 ATTTTGCCTT CCTGTTTTTG CTCACCCAGA AACGCTGGTG AAAGTAAAAG ATGCTGAAGA 54 0 

TCAGTTGGGT GCACGAGTGG GTTACATCGA ACTGGATCTC AACAGCGGTA AGATCCTTGA 600 

GAGTTTTCGC CCCGAAGAAC GTTTTCCAAT GATGAGCACT TTTAAAGTTC TGCTATGTGG 66 0 

CGCGGTATTA TCCCGTGTTG ACGCCGGGCA AGAGCAACTC GGTCGCCGCA TACACTATTC 720 

TCAGAATGAC TTGGTTGAGT ACTCACCAGT CACAGAAAAG CATCTTACGG ATGGCATGAC 78 0 
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AGTAAGAGAA TTATGCAGTG CTGCCATAAC 
TCTGACAACG ATCGGAGGAC CGAAGGAGCT 
TGTAACTCGC CTTGATCGTT GGGAACCGGA 
TGACACCACG ATGCCTTCAG CAATGGCAAC 
5 ACTTACTCTA GCTTCCCGGC AACAATTAAT 

ACCACTTCTG CGCTCGGCCC TTCCGGCTGG 
TGAGCGTGGG TCTCGCGGTA TCATTGCA6C 
CGTAGTTATC TACACGACGG GGAGTCAGGC 
TGAGATAGGT GCCTCACTGA TTAAGCATTG 
10 ACTTTAGATT GATTTAAAAC TTCATTTTTA 

TGATAATCTC ATGACCAAAA TCCCTTAACG 
CGTAGAAAAG ATCAAAGGAT CTTCTTGAGA 
GCAAACAAAA AAACCACCGC TACCAGCGGT 
TCTTTTTCCG AAGGTAACTG GCTTCAGCAG 
15 GTAGCCGTAG TTA6GCCACC ACTTCAAGAA 

GCTAATCCTG TTACCAGTGG CTGCTGCCAG 
CTCAAGACGA TAGTTACCGG ATAAGGCGCA 
ACAGCCCAGC TTGGAGCGAA CGACCTACAC 
AGAAAGCGCC ACGCTTCCCG AAGGGAGAAA 
20 CGGAACAGGA GAGCGCACGA GGGAGCTTCC 

TGTCGGGTTT CGCCACCTCT GACTTGAGCG 
GAGCCTATGG AAAAACGCCA GCAACGCGGC 
TGGCGGACGC GATGGATATG TTCTGCCAAG 
AGAATTGATT GGCTCCAATT CTTGGAGTGG 
25 CATTCAGGTC GAGGTGGCCC GGCTCCATGC 

gtatagggcg gcgcctacaa tccatgccaa 
aatcgccgtg acgatcagcg gtccagtgat 



CATGAGTGAT AACACTGCGG CCAACTTACT 84 0 

AACCGCTTTT TTGCACAACA TGGGGGATCA 900 

GCTGAATGAA GCCATACCAA ACGACGAGCG 960 

AACGTTGCGC AAACTATTAA CTGGCGAACT 1020 

AGACTGGATG GAGGCGGATA AAGTTGCAGG 1080 

CTG6TTTATT GCTGATAAAT CTGGAGCCGG 114 0 

ACTGGGGCCA GATGGTAAGC CCTCCCGTAT 12 00 

AACTATGGAT GAACGAAATA 6ACAGATCGC 1260 

GTAACTGTCA GACCAAGTTT ACTCATATAT 1320 

ATTTAAAAGG ATCTAGGTGA AGATCCTTTT 13 BO 

TGAGTTTTCG TTCCACTGAG CGTCAGACCC 1440 

TCCTTTTTTT CTGCGCGTAA TCTGCTGCTT 15 00 

GGTTTGTTTG CCGGATCAAG AGCTACCAAC 1560 

AGCGCAGATA CCAAATACTG TCCTTCTAGT 1620 

CTCTGTAGCA CCGCCTACAT ACCTCGCTCT 168 0 

TGGCGATAAG TCGTGTCTTA CCGGGTTGGA 174 0 

GCGGTCGGGC TGAACGGGGG GTTCGTGCAC 1800 

CGAACTGAGA TACCTACAGC 6TGAGCATTG 186 0 

GGCGGACAGG TATCCGGTAA GCGGCAGGGT 1920 

AGGGGGAAAC GCCTGGTATC TTTATAGTCC 198 0 

TCGATTTTTG TGATGCTCGT CAGGGGGGCG 204 0 

CCGAGATGCG CCGCGTGCGG CTGCTGGAGA 2100 

GGTTGGTTTG CGCATTCACA GTTCTCCGCA 216 0 

TGAATCCGTT AGCGAGGTGC CGCCGGCTTC 2220 

ACCGCGACGC AACGCGGGGA GGCAGACAAG 228 0 

CCCGTTCCAT GTGCTCGCCG AGGCGGCATA 234 0 

CGAAGTTAGG CTGGTAAGAG CCGCGAGCGA 240 0 
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TCCTTGAAGC TGTCCCTGAT GGTCGTCATC 
GGGCATCCCG ATGCCGCCGG AAGCGAGAAG 
CGTCGCGAAC GCCAGCAAGA CGTAGCCCAG 
TTACATTAAT TGCGTTGCGC TCACTGCCCG 
S TGCATTAATG AATCGGCCAA CGC6CGGGGA 

GTTTTTCTTT TCACCAGTGA GACGGGCAAC 
GAGAGTTGCA GCAAGCGGTC CACGCTG6TT 
GTGGTTGACG GCGGGATATA ACATGAGCTG 
ATATCCGCAC CAACGCGCAG CCCGGACTCG 
10 TGATCGTTGG CAACCAGCAT CGCAGTGGGA 

TGTTGAAAAC CGGACATGGC ACTCCAGTCG 
TTGCGAGTGA GATATTTATG CCAGCCAGCC 
GGGCCCCCGG TGACAGGGAC AGAGAGGGCT 
GATCCAAAGC TCCAGGGCGA GCTCGAATTC 
15 CTTACACATC CCAGCCCTGA AA7VAGGGCAT 

TTACGTTGAC ACCATCGAAT GGTGCAAAAC 
AGAGAGTCAA TTCAGGGTGG TGAATGTGAA 
TGCCGGTGTC TCTTATCAGA CCGTTTCCCG 
GAAAACGCGG GAAAAAGTGG AAGCGGCGAT 
20 GGCACAACAA CTGGCGGGCA AACAGTCGTT 

CCTGCACGCG CCGTCGCAAA TTGTCGCGGC 
CAGCGTGGTG GTGTCGATGG TAGAACGAAG 
CAATCTTCTC GCGCAACGCG TTCAGTGGGC 
CGCTGGATGA CCAGGATGCC ATTGCTGTGG 
25 TTCTTGATGT CTCTGACCAG ACACCCATCA 

CGCGACTGGG CGTGGAGCAT CTGGTCGCAT 
GCCCTGTACG TCTGAGGCCG AGGGTUIAGCT 



TACCTGCCTG GACAGCATGG CCTGCAACGC 24 6 0 

AATCATAAT6 GGGAAGGCCA TCCAGCCTCG 2520 

CGCGTCGGCC AGCTTGCAAT TCGCGCTAAC 2 580 

CTTTCCAGTC GGGAAACGTG TCGTGCCAGC 2€:40 

GAGGCGGTTT GCGTATTGGG CGCCAGGGTG 2700 

AGCTGATTGC CCTTCACC6C CTGGCCCTGA 276 0 

TGCCCCAGCA GGCGAAAATC CTGTTTGATG 282 0 

TCTTCGGTAT CGTCGTATCC CACTACCGAG 28 80 

GTAATGGCGC GCATTGCGCC CAGCGCCATC 294 0 

ACGATGCCCT CATTCAGCAT TTGCATGGTT 300 0 

CCTTCCCGTT CCGCTATCGG CTGAATTTGA 3 06 0 

AGACGCAGAC GCGCCGAGAC AGAACTTAAT 312 0 

TCTGGAGGAA AAAAGAAAAA AAAAAT^AAAA 318 0 

CCCGGTAAAG CCGCTTAAGA CATTCCCGCT 324 0 

CAAAATAAAC CACACCTATG GTGTATGCAT 33 00 

CTTTCGCGGT ATGGCATGAT AGCGCCCGGA 33 6 0 

ACCAGTAACG TTATACGATG TCGCAGAGTA 34 2 0 

CGTGGTGAAC CAGGCCAGCC ACGTTTCTGC 34 3 0 

GGCGGAGCTG AATTACATTC CCAACCGCGT 3 54 0 

GCTGATTGGC GTTGCCACCT CCAGTCTGGC 36 0 0 

GATTAAATCT CGCGCCGATC AACTGGGTGC 366 0 

CGGCGTCGAA GCCTGTAJVAG CGGCGGTGCA 372 0 

TAGATCTACC GCGGTAGATC ATTAACTATC 37 3 0 

AAGCTGCCTG CACTAATGTT CCGGCGTTAT 3S4 0 

ACAGTATTAT TTTCTCCCAT GAAGACGGTA 39 00 

TGGGTCACCA GCAAATCGCG CTGTTAGCGG 3 960 

ATGGGCGCGG TTTTCTTTCA TTGACCTGTC 4 02 0 
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GGTCTTATCA GTTCTCCGGG TTGTCAGGTC GACCAGTTGT TCCTTTGAGG TCCGGTTCTT 4080 

TTCGTTATGG GGTCATTTTT GGGCCACCTC CCCAGGTATG ACTTCCAGGT ATTCTCTGTG 4140 

GCCTGTCACT TTCCTCCCTG TCTCTTTTAT GCTTGTGATC TTTTAGATCT GGTCCTATTG 4200 

GACCTGGAGA TAGGTAGTAG AAACAAGGGT GTTTTTAAAT ACTAGTACAT TACGCCCCGC 4260 

5 CCTGCCACTC ATCGCAGTAC TGTTGTAATT CATTAAGCAT TCTGCCGACA TGGAAGCCAT 4320 

CACAGACGGC ATGATGAACC TGAATCGCCA GCGGCATCAG CACCTTGTCG CCTTGCGTAT 4380 

AATATTTGCC CATGGT671AA ACGGGGGCGA AGAAGTTGTC CATATTGGCC ACGTTTAAAT 4440 

CAAAACTGGT GAAACTCACC CAGGGATTGG CTGAGACGAA AAACATATTC TCAATAAACC 4500 

CTTTAGGGAA ATAGGCCAGG TTTTCACCGT AACACGCCAC ATCTTGCGAA TATATGTGTA 4560 

10 GAAACTGCCG GAAATCGTCG TGGTATTCAC TCCAGAGCGA T6AAAACGTT TCAGTTTGCT 4620 

CATGGAAAAC GGTGTAACAA GGGTGAACAC TATCCCATAT CACCAGCTCA CCGTCTTTCA 4680 

TTGCCATACG GAATTCCGGA TGAGCATTCA TCAGGCGGGC AAGAATGTGA ATAAAGGCCG 4740 

GATAAAACTT GTGCTTATTT TTCTTTACGG TCTTTAAAAA GGCCGTAATA TCCAGCTGAA 48 00 

CGGTCTGGTT ATAGGTACAT TGAGCAACTG ACTGAAATGC CTCAAAATGT TCTTTACGAT 4860 

15 GCCATTGGGA TATATCAACG GTGGTATATC CAGTGATTTT TTTCTCCATG ATTAATAGAA 4920 

TTATCCCCTG TTTCTACTCC CCCCCAACTT CGGAGGTCGA CCAGTACTCC GGGCGAAACT 498 0 

TTGTTTTTTT TTTTTCCCCC GATGCTGGAG GTCGACCAGA TGTCCGAAAG TGTCCCCCCC 504 0 

CCCCCCCCCC CCCGGCGCGG AACGGCGGGG CCACTCTGGA CTCTTTTTTT tTTTTTTTTT 5100 

TTTTTTTTTG GGGATCCTCT AGAGTCGACC TGCAGCCCAA GCTAGCGGCC GCTAGCTTCT 5160 

20 GTTTTGGCG6 ATGAGAGAAG ATTTTCAGCC TGATACAGAT TAAATCAGAA CGCAGAAGCG 5220 

GTCTGATAAA ACAGAATTTG C 5241 

(2) INFORMATION FOR SEQ ID NO: 3: 

(i) SEQUENCE CHARACTERISTICS; 

(A) LENGTH: 13 baee pairs 
25 (B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: RNA (genomic) 
(iii) HYPOTHETICAL: NO 
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(iv) ANTI- SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Influenza virus, 3" RNA sequence 

(C) INDIVIDUAL ISOLATE: Wild Type vRNA Promotor Element 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3: 
UCGUUUUCGU CCC 

(2) INFORMATION FOR SEQ ID NO: 4: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 13 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

<ii) MOLECULE TYPE: RNA (genomic) 

(iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Influenza virus, vRNA 5» sequence 

(C) INDIVIDUAL ISOLATE: pHL1104 vRNA Promoter Element 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4: 
UCAUCUUUGU CCC 

(2) INFORMATION FOR SEQ ID NO: 5: 

(i) SEQUENCE CHARACTERISTICS; 

(A) LENGTH: 14 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: RNA (genomic) 

(iii) HYPOTHETICAL: NO 

(iv) ANTI -SENSE: NO 

<vi) ORIGINAL SOURCE: 

(A) ORGANISM: Influenza virus, cRNA 3* sequence 
(C) INDIVIDUAL ISOLATE: cRNA Promoter element 
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(xi) SEOUENCE DESCRIPTION: SEQ ID NO: 5: 
AGUAGAAACA AGGG 

(2) INFORMATION FOR SEQ ID NO: 6: 

(i) SEQUENCE CHARACTERISTICS: 
S (A) LENGTH: 6602 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDN6SS : double 

(D) TOPOLOGY: circular 

(ii) MOLECULE TYPE: DNA (genomic) 

10 (iii) HYPOTHETICAL: NO 

(iv) ANTI- SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Influenza virus, RNA sequence 
(C) INDIVIDUAL ISOLATE; pHLllSl 



15 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 6: 

CTAGACTCTT CAAGCAAAAG CAGGTAGATC TTGAAAGATG AGTCTTCTAA CCGAGGTCGA 60 

AACGTACGTT CTCTCTATCA TCCCGTCAGG CCCCCTCAAA GCCGAGATCG CACAGAGACT 12 0 

TGAAGATGTC TTTGCAGGGA AGAACACCGA TCTTGAGGTT CTCATGGAAT GGCTAAAGAC 180 

AAGACCAATC CT6TCACCTC TGACTAAGGG GATTTTAGGA TTTGTGTTCA CGCTCACCGT 24 0 

20 GCCCAGTGAG CGAGGACTGC AGCGTAGACG CTTTGTCCAA AATGCCCTTA ATGGGAACGG 3 00 

GGATCCAAAT AACATGGACA AAGCAGTTAA ACTGTATAGG AAGCTCAAGA GG6A6ATAAC 36 0 

ATTCCATGGG GCCAAAGAAA TCTCACTCAG TTATTCTGCT GGTGCACTTG CCAGTTGTAT 42 0 

GGGCCTCATA TACAACAGGA TGGGGGCTGT GACCACTGAA GTGGCATTTG GCCTGGTATG 48 0 

TGCAACCTGT GAACAGATTG CTGACTCCCA GCATCGGTCT CATAGGCAAA TGGTGACAAC 54 0 

25 AACCAACCCA CTAATCAGAC ATGAGAACAG AATGGTTTTA GCCAGCACTA CAGCTAAGGC 6 00 

TATGGAGCAA ATGGCTGGAT CGAGTGAGCA AGCAGCAGAG GCCATGGAGG TTGCTAGTCA 66 0 

GGCTAGGCAA ATGGTGCAAG CGATGAGAAC CATTGGGACT CATCCTAGCT CCAGTGCTGG 720 

TCTGAAAAAT GCTCTTCTTG AAAATTTGCA 6GCCTATCAG AAACGAATGG GGGTGCAGAT 78 0 

GCAACGGTTC AAGTGATCCT CTCGCTATTG CCGCAAATAT CATTGGGATC TTGCACTTGA 84 0 
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TATTGTGGAT TCTTGATCGT CTTTTTTTCA 
TGAAA6GAGG 6CCTTCTACG 6AAGGAGTGC 
AACA6CAGAG TGCTGTGGAT GCTGACGATG 
AAACTACCAT ATGGGGCATG TCCCAAGTAT 
S G6GATGCGGA AT6TACCAGA GAAACAAACT 

ATAGAAAATG GTTGGGAGGG AATGATAGAC 
GAGGGCACAG GACAAGCAGC AGATCTTAAA 
GGGAAACTGA ATAGGGTAAT CGAGAAGACG 
TTCTCAGAAG TAGAAGG6AG AATTCAGGAC 
10 ATAGTGTCAC CTAAATGCTA GAGCTCGCTG 

GCCATCTGTT GTTTGCCCCT CCCCCGTGCC 
TGTCCTTTCC TAATAAAAT6 AGGAAATTGC 
TCTGGG6GGT GG6GTGGGGC AGGACAGCAA 
TGCTGGGGAT GCGGTGGGCT CTATGGCTTC 
15 GG6GTATCCC CAC6CGCCCT GTAGCGGCGC 

CAGCGTGACC GCTACACTTG CCAGCGCCCT 
CTTTCTCGCC ACGTTCGCCG GCTTTCCCCG 
GTTCCGATTT AGTGCTTTAC GGCACCTCGA 
ACGTAGTGGG CCATCGCCCT GATAGACGGT 
20 CTTTAATAGT GGACTCTTGT TCCAAACTGG 

CTTTTGATTT ATAAGGGATT TGGGGATTTC 
ACAAAAATTT AACGCGAATT AATTCTGTGG 
CCAGGCTCCC CAGGCAGGCA GAAGTATGCA 
GTGT6GAAAG TCCCCAGGCT CCCCAGCAGG 
25 GTCAGCAACC ATAGTCCCGC CCCTAACTCC 

CGCCCATTCT CCGCCCCATG GCTGACTAAT 
CTCTGCCTCT GAGCTATTCC AGAAGTAGTG 



AATGCATTTA CCGTCGCTTT AAATACGGAC 9 00 

CAAAGTCTAT GAGGGAAGAA TATCGAAAGG 960 

GTCATTTTGT CAGCATAGAG CTGGAGTAAA 1020 

GTTAAGCAAA ACACTCTGAA GTTGGCAACA loao 

AGAGGCATAT TCGGCGCAAT AGCAGGTTTC 1140 

GGTTGGTACG GTTTCAGGCA TCAAAATTCC 1200 

AGCACTCAAG CAGCCATCGA CCAAATCAAT 1260 

AACGAGAAAT TCCATCAAAT CGAAAAGGAA 1320 

CTCGAGCATG CATCTAGA66 GCCCTATTCT 1380 

ATCAGCCTCG ACTGTGCCTT CTAGTTGCCA 144 0 

TTCCTTGACC CTGGAAGGT6 CCACTCCCAC 1500 

ATCGCATTGT CTGAGTAGGT GTCATTCTAT 1560 

GGGGGAGGAT T6GGAAGACA ATAGCAGGCA 1620 

TGA6GCGGAA AGAACCAGCT GGGGCTCTAG 1680 

ATTAAGCGCG GCGGGTGTGG TGGTTACGCG 1740 

AGCGCCCGCT CCTTTCGCTT TCTTCCCTTC 1800 

TCAAGCTCTA AATCGGGGCA TCCCTTTAGG 1860 

CCCCAAAAAA CTTGATTAGG GTGATGGTTC 1920 

TTTTCGCCCT TTGACGTTGG AGTCCACGTT 1980 

AACAACACTC AACCCTATCT CGGTCTTATT 204 0 

GGCCTATTGG TTAAAAAATG AGCTGATTTA 2100 

AATGTGTGTC AGTTAGGGTG TGGAAAGTCC 2160 

AAGCATGCAT CTCAATTAGT CAGCAACCAG 2220 

CAGAAGTATG CAAAGCATGC ATCTCAATTA 2280 

6CCCATCCCG CCCCTAACTC CGCCCAGTTC 234 0 

TTTTTTTATT TATGCAGAGG CCGAGGCCGC 2400 

AGGAGGCTTT TTTGGAGGCC TAGGCTTTTG 2460 
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CAAAAAGCTC CCGGGAGCTT GTATATCCAT 
ATCGTTTCGC ATGATTGAAC AAGATGGATT 
GAGGCTATTC GGCTATGACT GGGCACAACA 
CCGGCTGTCA GCGCAGGGGC GCCCGGTTCT 
5 GAATGAACTG CAGGACGAGG CAGCGCGGCT 

CGCAGCTGTG CTCGACGTTG TCACTGAAGC 
GCCGGGGCAG GATCTCCTGT CATCTCACCT 
TGATGCAATG CGGCGGCTGC ATACGCTTGA 
G7VAACATCGC ATCGAGCGAG CACGTACTCG 
10 TCTGGACGAA GAGCATCA6G GGCTCGCGCC 

CATGCCCGAC GGCGAGGATC TCGTCGTGAC 
GGTG6AAAAT GGCCGCTTTT CTGGATTCAT 
CTATCAGGAC ATAGCGTTGG CTACCCGTGA 
TGACCGCTTC CTCGTGCTTT ACGGTATCGC 
15 TCGCCTTCTT GACGAGTTCT TCTGAGCGGG 

ACGCCCAACC TGCCATCACG AGATTTCGAT 
TTCGGAATCG TTTTCCGGGA CGCCGGCTGG 
GAGTTCTTCG CCCACCCCAA CTTGTTTATT 
AGCATCACAA ATTTCACAAA TAAAGCATTT 
20 AAACTCATCA ATGTATCTTA TCATGTCTGT 

TAATCATGGT CATAGCTGTX TCCTGTGTGA 
ATACGAGCCG GAAGCATAAA GTGTAAAGCC 
TTAATTGCGT TGCGCTCACT GCCCGCTTTC 
TAATGAATCG GCCAACGCGC GGGGAGA6GC 
25 TCGCTCACTG ACTCGCTGCG CTCGGTCGTT 

AAGGCGGTAA TACGGTTATC CACAGAATCA 
AAAGGCCAGC AAAAGGCCAG GAACCGTAAA 



TTTCGGATCT GATCAAGAGA CAGGATGAGG 2520 

GCACGCAGGT TCTCCGGCCG CTTGGGTGGA 2580 

GACAATCGGC TGCTCTGATG CCGCCGTGTT 2640 

TTTTGTCAAG ACCGACCTGT CCGGTGCCCT 27 00 

ATCGTG6CTG GCCACGACGG GCGTTCCTTG 2760 

GGGAAGGGAC TGGCTGCTAT T6GGCGAAGT 2820 

TGCTCCTGCC GAGAAAGTAT CCATCATGGC 2ft 80 

TCCGGCTACC TGCCCATTCG ACCACCAAGC 2940 

GATGGAAGCC GGTCTTGTCG ATCAGGATGA 3000 

AGCCGAACTG TTCGCCAGGC TCAAGGCGCG 3060 

CCATGGCGAT GCCTGCTTGC CGAATATCAT 3120 

CGACTGTGGC CGGCTGGGTG TGGCGGACCG 3180 

TATTGCTGAA GAGCTTGGCG GCGAATG6GC 324 0 

CGCTCCCGAT TCGCAGCGCA TCGCCTTCTA 3300 

ACTCTGGGGT TCGAAATGAC CGACCAAGCG 336 0 

TCCACCGCCG CCTTCTATGA AAGGTTGGGC 3420 

ATGATCCTCC AGCGCGGGGA TCTCATGCTG 3480 

GCAGCTTATA ATGGTTACAA ATAAAGCAAT 354 0 

TTTTCACTGC ATTCTAGTTG TGGTTTGTCC 3600 

ATACCGTCGA CCTCTAGCTA GAGCTTGGCG 366 0 

AATTGTTATC CGCTCACAAT TCCACACAAC 3720 

TGGGGTGCCT AATGAGTGAG CTAACTCACA 3780 

CAGTCGGGAA ACCTGTCGTG CCAGCTGCAT 3840 

GGTTTGCGTA TTGGGCGCTC TTCCGCTTCC 3900 

CGGCTGCGGC GAGCGGTATC AGCTCACTCA 396 0 

G6GGATAACG CAGGAAAGAA CATGTGAGCA 402 0 

AAGGCCGCGT TGCTGGCGTT TTTCCATAGG 408 0 
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CTCCGCCCCC CTGACGAGCA TCACAAAAAT 
ACAGGACTAT AAAGATACCA GGCGTTTCCC 
CCGACCCTGC C6CTTACC6G ATACCTGTCC 
TCTCAATGCT CACGCTGTAG GTATCTCAGT 
5 TGTGTGCAC6 AACCCCCCGT TCAGCCC6AC 

GAGTCCAACC CGGTAAGACA CGACTTATCG 
AGCAGAGCGA GGTATGTAGG CGGTGCTACA 
TACACTAGAA GGACAGTATT TGGTATCTGC 
AGAGTTGGTA GCTCTTGATC CGGCAAACAA 
10 TGCAA6CAGC AGATTACGCG CAGAAAAAAA 

ACGGGGTCTG ACGCTCAGTG GAACGAAAAC 
TCAAAAAGGA TCTTCACCTA GATCCTTTTA 
AGTATATATG AGTAAACTTG GTCTGACAGT 
TCAGCGATCT GTCTATTTCG TTCATCCATA 
15 ACGATACGGG AGGGCTTACC ATCTGGCCCC 

TCACCGGCTC CAGATTTATC A6CAATAAAC 
GGTCCTGCAA CTTTATCCGC CTCCATCCAG 
AGTAGTTCGC CAGTTAATAG TTTGCGCAAC 
TCACGCTCGT CGTTTGGTAT G6CTTCATTC 
20 ACATGATCCC CCATGTTGTG CAAAAAAGCG 

AGAAGTAAGT TGGCCGCAGT GTTATCACTC 
ACTGTCATGC CATCCGTAAG ATGCTTTTCT 
TGAGAATAGT GTATGCGGCG ACCGAGTTGC 
GCGCCACATA GCAGAACTTT AAAAGTGCTC 
25 CTCTCAAGGA TCTTACCGCT GTTGAGATCC 

TGATCTTCAG CATCTTTTAC TTTCACCAGC 
AATGCCGCAA AAAAGGGAAT AAGGGCGACA 



CGACGCTCAA GTCAGAGGTG GCGAAACCCG 414 0 

CCTGGAAGCT CCCTCGTGCG CTCTCCTGTT 4200 

GCCTTTCTCC CTTCGGGAAG CGTGGCGCTT 426 0 

TCGGTGTAGG TCGTTCGCTC CAAGCTGGGC 4320 

CGCTGCGCCT TATCCG6TAA CTATCGTCTT 438 0 

CCACTGGCAG CAGCCACTGG TAACAGGATT 444 0 

GAGTTCTTGA AGTGGTGGCC TAACTACGGC 4500 

GCTCTGCTGA AGCCAGTTAC CTTCGGAAAA 4560 

ACCACCGCTO GTAGCGGTGG TTTTTTTGTT 4620 

GGATCTCAAG AAGATCCTTT GATCTTTTCT 46 SO 

TCACGTTAAG GGATTTTGGT CATGAGATTA 474 0 

iUlTTAAAAAT GAAGTTTTAA ATCAATCTAA 4800 

TACCAATGCT TAATCAGTGA GGCACCTATC 4 860 

GTTGCCTGAC TCCCCGTCGT GTAGATAACT 4 920 

AGTGCTGCAA TGATACCGCG AGACCCACGC 4 980 

CAGCCAGCCG GAAGGGCC6A 6CGCAGAAGT 5040 

TCTATTAATT GTTGCCGGGA AGCTAGAGTA 5100 

GTTGTTGCCA TTGCTACAGG CATCGTGGTG 5160 

AGCTCCGGTT CCCAACGATC AAGGCGAGTT 5220 

GTTAGCTCCT TCGGTCCTCC GATCGTTGTC 5280 

ATGGTTATGG CAGCACTGCA TAATTCTCTT 534 0 

GTGACTGGTG AGTACTCAAC CAAGTCATTC 5400 

TCTTGCCCGG CGTCAATACG GGATAATACC 5460 

ATCATTGGAA AACGTTCTTC GGGGCGAAAA 5520 

AGTTCGATGT AACCCACTCG TGCACCCAAC 558 0 

GTTTCTGGGT GAGCAAAAAC AGGAA6GCAA 564 0 

CGGAAATGTT GAATACTCAT ACTCTTCCTT 57 OO 



-31 - 



PCT/EP95^3663 



TTTCAATATT ATTGAAGCAT TTATCAGGGT TATTGTCTCA TGAGCGGATA CATATTTGAA 576 0 

TGTATTTAGA AAAATAAACA AATAGGGGTT CCGCGCACAT TTCCCCGAAA AGTGCCACCT 582 0 

GACGTCGACG GATCGGGAGA TCTCCCGATC CCCTATGGTC GACTCTCAGT ACAATCTGCT 588 0 

CTGATGCCGC ATAGTTAAGC CAGTATCTGC TCCCTGCTTG TGTGTTGGAG GTCGCTGAGT 594 0 

5 AGTGCGCGAG CAAAATTTAA GCTACAACAA GGCAA6GCTT GACCGACAAT TGCATGAAGA 6000 

ATCTGCTTAG GGTTAGGCGT TTTGCGCTGC TTCGCGATGT ACGGGCCAGA TATACGCGTT 6060 

GACATTGATT ATTGACTAGT TATTAATAGT AATCAATTAC G6GGTCATTA GTTCATAGCC 612 0 

CATATATGGA GTTCCGCGTT ACATAACTTA CGGTAAATGG CCCGCCTGGC TGACCGCCCA 6180 

ACGACCCCCG CCCATTGACG TCAATAATGA CGTATGTTCC CATAGTAACG CCAATAGGGA 624 0 

10 CTTTCCATTG ACGTCAATGG GTGGACTATT TACGGTAAAC TGCCCACTTG GCAGTACATC 63 00 

AAGTGTATCA TATGCCAAGT ACGCCCCCTA TTGACGTCAA TGACGGTAAA TGGCCCGCCT 636 0 

GGCATTATGC CCAGTACATG ACCTTATGGG ACTTTCCTAC TTGGCAGTAC ATCTACGTAT 64 20 

TAGTCATCGC TATTACCATG GTGATGCGGT TTTGGCAGTA CATCAATGGG CGTGGATAGC 64 8 0 
GGTTTGACTC ACGGGGATTT CCAAGTCTCC ACCCCATTGA CGTCAATGGG AGTTTGTTTT 654 0 
15 GGCACCAAAA TCAACGGGAC TTTCCAAAAT GTCGTAACAA CTCCGCCCCA TTGACGCAAA 66 00 
TGGGCGGTAG GCGTGTACGG TGGGAGGTCT ATATAAGCAG AGCTCTCTGG CTAACTAGAG 6660 
AACCCACTGC TTACTGGCTT ATCGAAATTA ATACGACTCA CTATAGGGAG ACCCAAGCTT 672 0 
GGTACCGAGC TCGGATCCAC TAGTAACGGC CGCCAGTGTG CTGGAATTCT GCAGATTCTT 678 0 
CTCTCATCCG CCAAAACAGA AG 68 02 

20 (2) INFORMATION FOR SEQ ID NO: 7: 

(i> SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 5825 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 



(ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: NO 

(iv) ANTI- SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Influenza vir 
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(C) INDIVIDUAL ISOLATE: pHL1489 



(Xi) SEQUENCE DESCRIPTION; SEQ ID NO: 7: 

CGATGGTCAT TTTGTCCTGA TGAGTCCGTG AGGACGAAAC ATAGAGCTGG AGTAAAACTG 60 

AT6AGTCCGT 6A6GAC6AAA CTACCTTGTT TCTATTCGAA ATGACCGACC AAGCGACGCC 120 

5 CAACCTGCCA TCACGAGATT TCGATTCCAC CGCCGCCTTC TATGAAAGGT TGGGCTTCGG 180 

AATCGTTTTC CGGGACGCCG GCTGGATGAT CCTCCAGCGC GGGGATCTCA TGCTGGAGTT 240 

CTTCGCCCAC CCCGGGCTCG ATCCCCTCGC GAGTTGGTTC AGCTGCTGCC TGAGGCTGGA 300 

CGACCTCGCG GAGTTCTACC GGCAGTGCAA ATCCGTCGGC ATCCAGGAAA CCAGCAGCGG 36 0 

CTATCCGCGC ATCCATGCCC CCGAACTGCA GGAGTGGGGA GGCACGATGG CCGCTTTGGT 420 

10 CCCGGATCTT TGTGAAGGAA CCTTACTTCT GTGGTGTGAC ATAATTGGAC AAACTACCTA 48 0 

CAGAGATTTA AAGCTCTAAG GTAAATATAA AATTTTTAAG TGTATMTGT GTTAAACTAC 540 

TGATTCTAAT TGTTTGTGTA TTTTAGATTC CAACCTATGG AACTGATGAA TGGGAGCAGT 600 

GGTGGAATGC CTTTAATGAG GAAAACCTGT TTTGCTCAGA AGAAATGCCA TCTAGTGATG 66 0 

ATGAGGCTAC TGCTGACTCT CAACATTCTA CTCCTCCAAA AAAGAAGAGA AAGGTAGAAG 72 0 

15 ACCCCAAGGA CTTTCCTTCA GAATTGCTAA GTTTTTTGAG TCATGCTGTG TTTAGTAATA 78 0 

GAACTCTTGC TT6CTTTGCT ATTTACACCA CAAAGGAAAA AGCTGCACTG CTATACAAGA 84 0 

AAATTAT6GA AAAATATTCT GTAACCTTTA TAAGTAGGCA TAACAGTTAT AATCATAACA 900 

TACTGTTTTT TCTTACTCCA CACAGGCATA GAGTGTCTGC TATTAATAAC TATGCTCAAA 960 

AATTGTGTAC CTTTAGCTTT TTAATTTGTA AAGGGOTTAA TAAGGAATAT TTGATGTATA 1020 

20 GTGCCTTGAC TAGAGATCAT AATCAGCCAT ACCACATTTG TA6AGGTTTT ACTTGCTTTA 1080 

AAAAACCTCC CACACCTCCC CCTGAACCTG AAACATAAAA TGAATGCAAT TGTTGTTGTT 114 0 

AACTTGTTTA TTGCAGCTTA TAATGGTTAC AAATAAAGCA ATAGCATCAC AAATTTCACA 12 00 

AATAAAGCAT TTTTTTCACT GCATTCTAGT TGTGGTTTGT CCAAACTCAT CAATGTATCT 126 0 

TATCATGTCT GGATCCCCAG GAAGCTCCTC TGTGTCCTCA TAAACCCTAA CCTCCTCTAC 132 0 

25 TTGAGAGGAC ATTCCAATCA TAGGCT6CCC ATCCACCCTC TGTGTCCTCC TGTTAATTAG 138 0 

GTCACTTAAC AAAAAGGAAA TTGGGTAGGG GTTTTTCACA GACCGCTTTC TAAGGGTAAT 144 0 
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TTTAAAATAT CTGGGAAGTC CCTTCCACTG CTGTGTTCCA GAAGTGTTGG TAAACAGCCC 1500 

ACAAATGTCA ACAGCAGAAA CATACAAGCT GTCAGCTTTG CACAAGGGCC CAACACCCTG 1S60 

CTCATCAAGA AGCACTGTGG TTGCTGTGTT AGTAATGTGC AAAACAGGAG GCACATTTTC 162 0 

CCCACCTGTG TAGGTTCCAA AATATCTAGT GTTTTCATTT TTACTTGGAT CAGGAACCCA 16 80 

5 GCACTCCACT GGATAAGCAT TATCCTTATC CAAAACAGCC TTGTGGTCAG TGTTCATCTG 174 0 

CTGACTGTCA ACTGTAGCAT TTTTTGGGGT TACAGTTTGA GCAGGATATT TGGTCCTGTA 180 0 

GTTTGCTAAC ACACCCTGCA GCTCCAAAGG TTCCCCACCA ACAGCAAAAA AATGAAAATT 1860 

TGACCCTTGA ATGGGTTTTC CAGCACCATT TTCATGAGTT TTTTGTGTCC CTGAATGCAA 1920 

GTTTAACATA GCAGTTACCC CAATAACCTC AGTTTTAACA GTAACAGCTT CCCACATCAA 198 0 

10 AATATTTCCA CAGGTTAAGT CCTCATTTAA ATTAGGCAAA GGAATTCTTG AAGACGAAAG 204 0 

GGCCTCGTGA TACGCCTATT TTTATAGGTT AATGTCATGA TAATAATGGT TTCTTAGACG 2100 

TCAGGTGGCA CTTTTCGGGG AAATGTGCGC GGAACCCCTA TTTGTTTATT TTTCTAAATA 2160 

CATTCAAATA TGTATCCGCT CATGAGACAA TAACCCTGAT AAATGCTTCA ATAATATTGA 22 2 C 

AAAAGGAAGA GTATGAGTAT TCAACATTTC CGTGTCGCCC TTATTCCCTT TTTTGCG6CA 22 8 0 

15 TTTTGCCTTC CTGTTTTTGC TCACCCAGAA ACGCTGGTGA AAGTAAAAGA TGCTGAAGAT 234 C 

CAGTTGGGTG CACGAGTGGG TTACATCGAA CTGGATCTCA ACAGCGGTAA GATCCTTGAG 24 OC 

AGTTTTCGCC CCGAAGAACG TTTTCCAATG ATGAGCACTT TTAAAGTTCT GCTATGTGGC 24 60 

GCGGTATTAT CCCGTGTTGA CGCCGGGCAA GAGCAACTCG GTCGCCGCAT ACACTATTCT 252C 

CAGAATGACT TGGTTGAGTA CTCACCAGTC ACAGAAAAGC ATCTTACGGA TGGCATGACA 2 58C 

20 GTAAGAGAAT TATGCAGTGC TGCCATAACC ATGAGTGATA ACACTGCGGC CAACTTACTT 264C 

CTGACAACGA TCGGAGGACC GAAGGAGCTA ACCGCTTTTT TGCACAACAT GGGGGATCAT 270 0 

GTAACTCGCC TTGATCGTTG GGAACCGGAG CTGAATGAAG CCATACCAAA CGACGAGCGT 276 C 

GACACCACGA TGCCTGCAGC AATGGCAACA ACGTTGCGCA AACTATTAAC TGGCGAACTA 282 0 

CTTACTCTAG CTTCCCGGCA ACAATTAATA GACTGGATGG AGGCGGATTIA AGTTGCAGGA 288C 

25 CCACTTCTGC GCTCGGCCCT TCCGGCTGGC TGGTTTATTG CTGATAAATC TGGAGCCGGT 294 0 

GAGCGTGGGT CTCGCGGTAT CATTGCAGCA CTGGGGCCAG ATGGTAAGCC CTCCCGTATC 300 C 

GTAGTTATCT ACACGACGGG GAGTCAGGCA ACTATGGATG AACGAAATAG ACAGATCGCT 3 06 0 
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GAGATAGGTG CCTCACTGAT TAAGCATTGG 
CTTTAGATTG ATTTAAAACT TCATTTTTAA 
GATAATCTCA T6ACCAAAAT CCCTTAACGT 
GTAGAAAAGA TCAAAGGATC TTCTTGAGAT 
5 CAAACAAAAA AACCACCGCT ACCAGCGGTG 

CTTTTTCCGA AGGTAACTGG CTTCAGCA6A 
TAGCCGTAGT TAGGCCACCA CTTCAAGAAC 
CTAATCCTGT TACCAGTGGC TGCTGCCAGT 
TCAAGACGAT AGTTACCGGA TAAGGCGCAG 
10 CAGCCCAGCT TGGAGCGAAC GACCTACACC 

GAAAGCGCCA CGCTTCCCGA AGGGAGAAAG 
GGAACAGGAG AGCGCACGAG GGAGCTTCCA 
GTCGGGTTTC GCCACCTCTG ACTTGAGCGT 
AGCCTATGGA AAAACGCCAG CAACGCG6CC 
15 TTTGCTCACA TGTTCTTTCC TGCGTTATCC 

TTTGAGTGAG CTGATACCGC TCGCCGCAGC 
GAGGAAGCGG AAGAGCGCCT 6ATGCGGTAT 
CACCGCATAT GGTGCACTCT CAGTACAATC 
ACACTCCGCT ATCGCTACGT GACTGGGTCA 
20 CTGACGCGCC CTGACGGGCT TGTCTGCTCC 

TCTCCGGGAG CTGCATGTGT CAGAGGTTTT 
TGTGGAATGT GTGTCAGTTA GGGT6TGGAA 
TGCAAAGCAT GCATCTCAAT TAGTCAGCAA 
CAGGCAGAAG TATGCAAAGC ATGCATCTCA 
25 CTCCGCCCAT CCCGCCCCTA ACTCCGCCCA 

TAATTTTTTT TATTTATGCA GAGGCCGAGG 
AGTGA6GAGG CTTTTTTGGA GGCCTAGGCT 



TAACTGTCAG ACCAAGTTTA CTCATATATA 3X20 

TTTAJUIAGGA TCTAGGTGAA GATCCTTTTT 3180 

GAGTTTTCGT TCCACTGAGC GTCAGACCCC 3240 

CCTTTTTTTC TGCGCGTAAT CTGCTGCTTG 3300 

GTTTGTTTGC CGGATCAAGA GCTACCAACT 336 0 

GCGCAGATAC CAAATACTGT CCTTCTAGTG 3420 

TCTGTAGCAC CGCCTACATA CCTCGCTCTG 3480 

GGCGATAAGT CGTGTCTTAC CGGGTTGGAC 3540 

CGGTCGGGCT GAACGGGGGG TTCGTGCACA 36 00 

GAACTGAGAT ACCTACAGCG TGAGCTATGA 3660 

GCGGACAGGT ATCCGGTAAG CGGCAGGGTC 3720 

GGGGGAAACG CCTGGTATCT TTATAGTCCT 378 0 

CGATTTTTGT GATGCTCGTC AGGGGGGCGG 384 0 

TTTTTACGGT TCCTGGCCTT TTGCTGGCCT 3 9O0 

CCTGATTCTG TGGATAACCG TATTACCGCC 3960 

CGAACGACCG AGCGCAGCGA GTCAGTGAGC 4020 

TTTCTCCTTA CGCATCTGTG CGGTATTTCA 4 08 0 

TGCTCTGATG CCGCATAGTT AAGCCAGTAT 414 0 

TGGCTGCGCC CCGACACCCG CCAACACCCG 4200 

CGOCATCCGC TTACAGACAA GCTGTGACCG 4260 

CACCGTCATC ACCGAAACGC GCGAGGCAGC 4320 

AGTCCCCAGG CTCCCCAGCA GGCAGAAGTA 4380 

CCAGGTGTGG AAAGTCCCCA GGCTCCCCAG 4440 

ATTAGTCAGC AACCATAGTC CCGCCCCTAA 4500 

GTTCCGCCCA TTCTCCGCCC CATGGCTGAC 4560 

CCGCCTCGGC CTCTGAGCTA TTCCAGAAGT 4 62 0 

TTTGCAAAAA GCTTCACGCT GCCGCAAGCA 468 0 
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CTCAGGGCGC AAGGGCTGCT AAAGGAAGCG GAACACGTAG AAAGCCAGTC CGCAGAAACG 474 0 

GTGCTGACCC CGGATGAATG TCAGCTACTG 6GCTATCTGG ACAAGGGAAA ACGCAAGCGC 48 00 

AAAGAGAAAG CAGGTAGCTT GCAGTGG6CT TACATGGCGA TAGCTAGACT GGGCGGTTTT 4860 

ATGGACAGCA AGCGAACCGG AATTGCCAGC TGGGGCGCCC TCTGGTAAGG TTGGGAAGCC 4920 

5 CTGCAAAGTA AACT6GATGG CTTTCTTGCC GCCAAGGATC TGATGGCGCA GGGGATCAAG 4980 

ATCTGATCAA GAGACAGGAT GAG6ATCGTT TCGCATGATT GAACAAGATG GATTGCACGC 5040 

AGGTTCTCCG GCCGCTTGGG TGGAGAGGCT ATTCGGCTAT GACTGGGCAC AACAGACAAT 5100 

CGGCTGCTCT GATGCCGCCG TGTTCCGGCT GTCAGCGCAG GGGCGCCCGG TTCTTTTTGT 516 0 

CAAGACCGAC CTGTCCGGTG CCCTGAATGA ACTGCAGGAC GAGGCAGCGC GGCTATCGTG 5220 

10 GCTGGCCACG ACGGGCGTTC CTTGCGCAGC TGTGCTCGAC GTTGTCACTG AAGCGGGAAG 528 0 

GGACTGGCTG CTATTGGGCG AAGTGCCGGG GCAGGATCTC CTGTCATCTC ACCTTGCTCC 534 0 

TGCCGAGAAA GTATCCATCA TGGCTGATGC AATGCGGCGG CTGCATACGC TTGATCCGGC 54 0 0 

TACCTGCCCA TTCGACCACC AAGCGAAACA TCGCATCGAG CGAGCACGTA CTCGGATGGA B4 6 0 

AGCCGGTCTT GTCGATCAGG ATGATCTGGA CGAAGAGCAT CAGGGGCTCG CGCCAGCCGA 552 0 

15 ACTGTTCGCC AGGCTCAAGG CGCGCATGCC CGACGGCGAG GATCTCGTCG TGACCCATGG 5580 

CGATGCCTGC TTGCCGAATA TCATGGTGGA AAATGGCCGC TTTTCTGGAT TCATCGACTG 564 0 

TGGCCGGCTG GGTGTGGCGG ACCGCTATCA GGACATAGCG TTGGCTACCC GTGATATTGC 57 0 0 

TGAAGAGCTT GGCGGCGAAT GGGCTGACCG CTTCCTCGTG CTTTACGGTA TCGCCGCTCC 576 0 

CGATTCGCAG CGCATCGCCT TCTATCGCCT TCTTGACGAG TTCTTCTGAG CGGGACTCTG 5S2 0 

20 GGGTT 5825 
(2) INFORMATION FOR SEQ ID NO: 8: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4023 base pairs 

(B) TYPE: nucleic acid 
25 (C) STRANDEDNESS : double 

(D) TOPOLOGY: circular 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 
(iv) 7VNTI-SENSE: NO 
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(vi) ORIGINAL SOURCE: . 

(A) ORGANISM: Influenza virus, RNA sequence 
(C) INDIVIDUAL ISOLATE: pHL14 90 

(xi) SEQUENCE DESCRIPTION: SBQ ID NO: 8: 

CAGGTACATA TTGAAAGATG AGTCTTCTAA CCGAGGTCGA AACGTACGTT CTCTCTATCA 6 0 

TCCCGTCAGG CCCCCTCAAA GCCGAGATCG CACAGAGACT TGAAGATGTC TTTGCAGGGA 12 0 

AGAACACCGA TCTTGAGGTT CTCATGGAAT GGCTAAAGAC AAGACCAATC CTGTCACCTC 18 0 

TGACTAAGGG GATTTTAGGA TTTGTGTTCA CGCTCACCGT GCCCAGTGAG CGAGGACTGC 24 0 

AGCGTAGACG CTTTGTCCAA AATGCCCTTA ATGGGAACGG GGATCCAAAT AACATGGACA 300 

AAGCAGTTAA ACTGTATAGG AAGCTCAAGA GGGAGATAAC ATTCCATGGG GCCAAAGAAA 360 

TCTCACTCAG TTATTCTGCT GGTGCACTTG CCAGTTGTAT GGGCCTCATA TACAACAGGA 420 

TGGGGGCTGT GACCACTGAA GT6GCATTTG GCCTGGTATG TGCAACCTGT GAACAGATTG 4 80 

CTGACTCCCA GCATCGGTCT CATAGGCAAA TGGTGACAAC AACCAACCCA CTAATCAGAC 54 0 

ATGAGAACA6 AATGGTTTTA GCCAGCACTA CAGCTAAGGC TATGGAGCAA ATGGCTGGAT 6 00 

CGAGTGAGCA AGCAGCA6AG GCCATGGAGG TT6CTAGTCA GGCTAGGCAA ATGGTGCAAG 660 

CGATGAGAAC CATTGGGACT CATCCTAGCT CCAGTGCTGG TCTGAAAAAT GCTCTTCTTG 720 

AAAATTTGCA GGCCTATCAG AAACGAATGG G6gTGCAGAT GCAACGGTTC AAGTGATCCT 780 

CTCGCTATTG CCGCAAATAT CATTGGGATC TTGCACTTGA TATTGTGGAT TCTTGATCGT 840 

CTTTTTTTCA AATGCATTTA CCGTCGCTTT AAATACGGAC TGAAAGGAGG GCCTTCTACG 900 

GAAGGAGTGC CAAAGTCTAT GAGGGAAGAA TATCGAAAGG AACAGCAGAG TGCTGTGGAT 960 

GCTGACGATG GTCATTTTGT CAGTATAGAG CTGGAGTAAA AAAGTACCTT GTTTCTACTA 1020 

CCTATCTCCA GGTCCAATAG GACCAGATCT AAAAGATCAC AAGCATAAAA GAGACAGGGA 1080 

GGAAAGTGAC AGGCCACAGA GAATACCTGG AAGTCATACC TGGGGAGGTG GCCCAAAAAT 114 0 

GACCCCATAA CGAAAAGAAC CGGACCTCAA AGGAACAACT GGTCGACCTG ACAACCCGGA 1200 

GAACTGATAA GACCGACAGG TCAATGAAAG AAAACCGCGC CCATAGCTTT CCCTCGGCCT 126 0 

CAGACGTACA GGGCCCGCTA ACAGCGCGAT TTGCTGGTGA CCCAATGCGA CCAGATGCTC 132 0 

CACGCCCAGT CGCGTACCGT CTTCATGGGA GAAAATAATA CTGTTGATGG GTGTCTGGTC 138 0 
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AGAGACATCA AGAAATAACG CCGGAACATT AGTGCAGGCA GCTTCCACAG CAATGGCATC 1440 

CTGGTCATCC A6CGGATAGT TAATGATCTA CC6CGGTAGA TCTAGCCCAC TGAACGCGGG 1500 

CGGCACCTCG CTAACGGATT CACCACTCCA AGAATTGGAG CCAATCAATT CTTGCGGAGA 1560 

ACTGTGAATG CGCAAACCAA CCCTTGGCAG AACATATCCA TCGCGTCCGC CATCTCCAGC 1620 

5 AGCCGCACGC GGCGCATCTC GGGCCGCGTT GCTGGCGTrr TTCCATAGGC TCCGCCCCCC 1680 

TGACGAGCAT CACAAAAATC 6ACGCTCAAG TCAGAGGTGG CGAAACCCGA CAGGACTATA 1740 

AAGATACCAG GCGTTTCCCC CTGGAAGCTC CCTCGTGCGC TCTCCTGTTC C6ACCCTGCC 18 00 

GCTTACCGGA TACCTGTCCG CCTTTCTCCC TTCGGGAAGC GTGGCGCTTT CTCAATGCTC 1860 

ACGCTGTAGG TATCTCAGTT CGGTGTAGGT CGTTCGCTCC AAGCTGGGCT GTGTGCACGA 192 0 

10 ACCCCCCGTT CAGCCCGACC 6CTGCGCCTT ATCCGGTAAC TATCGTCTTG AGTCCAACCC 198 0 

GGTAAGACAC GACTTATCGC CACTGGCAGC AGCCACTGGT AACAGGATTA GCAGAGCGAG 204 0 

i 

GTATGTAGGC GGTGCTACAG AGTTCTTGAA GTGGTGGCCT AACTACGGCT ACACTAGAAG 2100 

GACAGTATTT GGTATCTGCG CTCTGCTGAA GCCAGTTACC TTCGGAAAAA GAGTTGGTAG 2160 

CTCTTGATCC GGCAAACAAA CCACCGCTGG TAGCGGTGGT TTTTTTGTTT GCAAGCAGCA 2220 

15 GATTACGCGC AGAAAAAAAG GATCTCAAGA AGATCCTTTG ATCTTTTCTA CGGGGTCTGA 22 80 

CGCTCAGTGG AACGAAAACT CACGTTAAGG GATTTTGGTC ATGAGATTAT CAAAAAGGAT 2340 

CTTCACCTAG ATCCTTTTAA ATTAAAAATG AAGTTTTAAA TCAATCTAAA GTATATATGA 24 00 

GTAAACTTGG TCTGACAGTT ACCAATGCTT AATCAGTGAG GCACCTATCT CAGCGATCTG 2460 

TCTATTTCGT TCATCCATAG TTGCCTGACT CCCCGTCGTG TAGATAACTA CGATACGGGA 2 52 0 

20 GGGCTTACCA TCTGGCCCCA GTGCTGCAAT GATACCGCGA GACCCACGCT CACCGGCTCC 258 0 

AGATTTATCA GCAATAAACC AGCCAGCCGG AAGGGCCGAG CGCAGAAGTG GTCCTGCAAC 264 0 

TTTATCCGCC TCCATCCAGT CTATTAATTG TTGCCGGGAA GCTAGAGTAA GTAGTTCGCC 2700 

AGTTAATAGT TTGCGCAACG TTGTTGCCAT TGCTGCAGGC ATCGTGGTGT CACGCTCGTC 2760 

GTTTGGTATG GCTTCATTCA GCTCCGGTTC CCAACGATCA AGGCGAGTTA CATGATCCCC 2820 

25 CATGTTGTGC AAAAAAGCGG TTAGCTCCTT CGGTCCTCCG ATCGTTGTCA GAAGTAAGTT 2880 

GGCCGCAGTG TTATCACTCA TGGTTATGGC AGCACTGCAT AATTCTCTTA CTGTCATGCC 294 0 

ATCCGTAAGA TGCTTTTCTG TGACTGGTGA GTACTCAACC AAGTCATTCT GAGAATAGTG 30 00 
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TATGCGGCGA 


CCGAGTTGCT 


CTTGCCCGGC 


GTCAACACGG 


GATAATACCG 


CGCCACATAG 


3060 


CAGAACTTTA 


AAAGTGCTCA 


TCATTGGAAA 


ACGTTCTTCG 


gggcgaaaac 


TCTCAAGGAT 


3120 


CTTACCGCTG 


TTGAGATCCA 


GTTCGATGTA 


ACCCACTCGT 


GCACCCAACT 


GATCTTCAGC 


31B0 


ATCTTTTACT 


TTCACCAGC6 


TTTCTGGGTG 


AGCAAAAACA 


GGAAGGCAAA 


ATGCCGCAAA 


3240 


AAAG6GAATA 


AGGGCGACAC 


G6AAATGTTG 


AATACTCATA 


CTCTTCCTTT 


TTCAATATTA 


3300 


TTGAAGCATT 


TATCAG6GTT 


ATTGTCTCAT 


GAGCGGATAC 


ATATTTGAAT 


GTATTTAGAA 


3360 


AAATAAACAA 


AAGAGTTTGT 


AGAAACGCAA 


AAA66CCATC 


CGTCAGGATG 


GCCTTCTGCT 


3420 


TAATTT6ATG 


CCTGGCAGTT 


TATGGCGGGC 


GTCCTGCCCG 


CCACCCTCCG 


GGCCGTTGCT 


3480 


TCGCAACGTT 


CAAATCCGCT 


CCCGGCGGAT 


TTGTCCTACT 


CAGGAGAGCG 


TTCACCGACA 


3540 


AACAACAGAT 


AAAACGAAAG 


GCCCAGTCTT 


TCGACTGAGC 


CTTTCGTTTT 


ATTTGATGCC 


3600 


TGGCAGTTCC 


CTACTCTCGC 


ATGGGGAGAC 


CCCACACTAC 


CATCGGCGCT 


ACGGCGTTTC 


3660 


ACTTCTGAGT 


TCGGCATGGG 


GTCAGGTGGG 


ACCACCGCGC 


TACTGCCGCC 


AGGCAAATTC 


3720 


TGTTTTATCA 


GACCGCTTCT 


GCGTTCTGAT 


TTAATCTGTA 


TCAGGCTGAA 


AATCTTCTCT 


3780 


CATCCGCCAA 


AACAGAAGCT 


A6CGGCCGCT 


AGCTTGGGCT 


GCAGGTCGAC 


TCTAGAGGAT 


3840 


CCCCAAAAAA 


AAAAAAAAAA 


AAAAAAAAAA 


GAGTCCAGAG 


TGGCCCCGCC 


GTTCCGCGCC 


3900 


GGGGGGGGGG 


GGGGGGGGGG 


ACACTTTCGG 


ACATCTGGTC 


GACCTCCAGC 


ATCGGGGGAA 


3960 


AAAAAAAAAA 


CAAAGTTTCG 


CCCGGAGTAC 


TCSGTCGACCT 


CCGAAGTTGG 


GGGGGAGTAG 


4020 


AAA 












4023 
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CLAIMS: 

L A scgmeated RNA vims characterized in that it comprises one or more 
segments wfa^ich have been genetically modified to show improved 
transcription, replication and/or expression rates and which may contain 
5 non viral genes to be transferred to the host. 

2. The virus of claim 1, wherrin one or more modifications have been 
introduced in the noncoding region(s) and/or in the coding re^on(s). 

3. The virus of any of claims 1 or 2, wherein at least one modified segment is 
derived from an original one by sequence variation(s). 

10 4. The virus of any of claims 1 or 2, wherein at least one modified segment is 
an artificial addition to the set of original segments. 

S. The virus of claim 4, wherein the modified segment comprises a nucleotide 
sequence which codes for a protein or peptide which is foreign to the 
original virus. 

IS 6. The virus of claim S, wherein the foreign protein or peptide constitutes an 
antigen or antigen-like sequence, a T-cell epitope or related sequence. 

7. The virus of any of claims 5 or 6, wherein the segment comprises 
repetitions of an antigen or epitope or other peptide or protein.. 

8. The virus of any of daims 7 to 9, wherein the antigen or epitope is derived 
20 from HIV, Herpes-Virus, Rhinovirus, CMV, papilloma viruses. Hepatitis 

viruses and other human viruses or Hog Cholera Virus. 

9. A pharmaceutical preparation of a vaccine comprising the virus of any of 
claims I to 8. 

10. Use of the virus of any of cldms 1 to 8 for the preparation of pharma- 
25 ceuticals. 
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